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-ri „ , . , MQRAXELLA CATARRHALIS BASB034 POLYPEPTIDES AND USES THEREOF 

This invention relates to polynucleotides, (herein referred to as " BASB034 
5 polynucleotide(s)" ), polypeptides encoded by them (referred to herein as " BASB034" or 
" BASB034 polypeptide(s)"), recombinant materials and methods for their production. In 
another aspect, the invention relates to methods for using such polypeptides and 
polynucleotides, including vaccines against bacterial infections. In a further aspect, the 
invention relates to diagnostic assays for detecting infection of certain pathogens. 

10 

Q BACKGROUND OF THE INVENTION 

~r~. z Moraxeila catarrhalis (also named Branhamella catarrhalis) is a Gram negative bacteria 

gg frequently isolated from the human upper respiratory tract. It is responsible for several 

^ pathologies the main ones being otitis media in infants and children, and pneumonia in 

yy 1 5 elderlies. It is also responsible of sinusitis, nosocomial infections and less frequently of 

^ invasive diseases. 

in 

^ Otitis media is an important childhood disease both by the number of cases and its potential 

jF sequelae. More than 3.5 millions cases are recorded every year in the United States, and it is 

^2 20 estimated that 80 % of the children have experienced at least one episode of otitis before 

reaching the age of 3 (Klein, JO (1994) Clin.Inf.Dis 19:823). Left untreated, or becoming 
chronic, this disease may lead to hearing losses that could be temporary (in the case of fluid 
accumulation in the middle ear) or permanent (if the auditive nerve is damaged). In infants, 
such hearing losses may be responsible for a delayed speech learning. 



25 



Three bacterial species are primarily isolated from the middle ear of children with otitis 
media: Streptococcus pneumoniae, non typeable Haemophilus influenza (NTHi) and A/. 
catarrhalis. They are present in 60 to 90 % of the cases. A review of recent studies shows 
that & pneumoniae and NTHi represent both about 30 %. and M catarrhalis about 1 5 % of 
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the otitis media cases (Murphy, TF (1996) MicrobioLRev. 60:267). Other bacteria could be 
isolated from the middle ear (//. influenza type B, S. pyogenes etc) but at a much lower 
frequency (2 % of the cases or less). 

5 Epidemiological data indicate that, for the pathogens found in the middle ear, the 

colonization of the upper respiratory tract is an absolute prerequisite for the development of 
an otitis; other are however also required to lead to the disease (Dickinson, DP et al. (1988) 
J. Infect.Dis. 158:205, Faden, HL et al. (1991) Ann.Otorhinol.Laryngol. 100:612). These are 
important to trigger the migration of the bacteria into the middle ear via the Eustachian 
1 0 tubes, followed by the initiation of an inflammatory process. These factors are unknown 
□ todate. It has been postulated that a transient anomaly of the immune system following a 

~^ viral infection, for example, could cause an inability to control the colonization of the 

00 respiratory tract (Faden, HL et al (1994) J. Infect.Dis. 169:1312). An alternative explanation 

is that the exposure to environmental factors allow a more important colonization of some 
SO 1 5 children, who subsequently become susceptible to the development of otitis media because 

of the sustained presence of middle ear pathogens (Murphy, TF (1996) MicrobioLRev. 
O 60:267). 

in 
if*- 

The immune response to A£ catarrhalis is poorly characterized. The analysis of strains 
20 isolated sequentially from the nasopharynx of babies followed from 0 to 2 years of age, 
indicates that they get and eliminate frequently new strains. This indicates that an 
efficacious immune response against this bacteria is mounted by the colonized children 
(Faden, HL et al (1994) J. InfectDis. 169:1312). 

25 In most adults tested, bactericidal antibodies have been identified (Chapman, AJ et al. 

(1985) J. InfectDis. 151:878). Strains of M. catarrhalis present variations in their capacity 
to resist serum bactericidal activity: in general, isolates from diseased individuals are more 
resistant than those who are simply colonized (HoU C et al. (1993) Lancet 341 : 1281 . Jordan. 
KL et al. (1990) AmJ.Med. 88 (suppl. 5A):28S). Serum resistance could therfore be 
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considered as a virulence factor of the bacteria. An opsonizing activity has been observed in 
the sera of children recovering from otitis media. 

The antigens targetted by these different immune responses in humans have not been 
5 identified, with the exception of OMP B I , a 84 kDa protein which expression is regulated 
by iron, and that is recognized by the sera of patients with pneumonia (Sethi, S, et al. (1995) 
InfecUmmun. 63:1516) , and of UspAl and UspA2 (Chen D. et al.(1999), InfecUmmun. 
67:1310). 

1 0 A few other membrane proteins present on the surface of M. catarrhalis have been 

characterized using biochemical method, or for their potential implication in the induction 
*p t of a protective immunity (for review, see Murphy, TF (1996) Microbiol.Rev. 60:267). In a 

mouse pneumonia model, the presence of antibodies raised against some of them (UspA, 
CopB) favors a faster clearance of the pulmonary infection. Another polypeptide (OMP CD) 
15 is highly conserved among M catarrhalis strains, and presents homologies with a porin of 
Pseudomonas aeruginosa, which has been demonstrated efficacious against this bacterium 
in animal models. 
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The frequency of Moraxella catarrhalis infections has risen dramatically in the past few 
20 decades. This has been attributed to the emergence of multiply antibiotic resistant strains 
and an increasing population of people with weakened immune systems. It is no longer 
uncommon to isolate Moraxella catarrhalis strains that are resistant to some or all of the 
standard antibiotics." This phenomenon has created an unmet medical need and demand for 
new anti-microbial agents, vaccines, drug screening methods, and diagnostic tests for this 
25 organism. 
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The present invention relates to BASB034, in particular BASB034 polypeptides and 
B ASB034 polynucleotides, recombinant materials and methods for their production. In 
another aspect, the invention relates to methods for using such polypeptides and 
polynucleotides, including prevention and treatment of microbial diseases, amongst others. 
In a further aspect, the invention relates to diagnostic assays for detecting diseases 
associated with microbial infections and conditions associated with such infections, such 
as assays for detecting expression or activity of BASB034 polynucleotides or 
polypeptides. 

Various ^hanges and modifications within the spirit and scope of the disclosed invention 
will become readily apparent to those skilled in the art from reading the following 
descriptions and from reading the other parts of the present disclosure. 

DESCRIPTION OF THE INVENTION 

The invention relates to BASB034 polypeptides and polynucleotides as described in greater 
detail below. In particular, the invention relates to polypeptides and polynucleotides of 
BASB034 of Moraxella catarrhalis, which is related by amino acid sequence homology to 
Klebsiella pneumoniae outer membrane phospholipase A protein. The invention relates 
especially to BASB034 having the nucleotide and amino acid sequences set out in SEQ ID 
NO: 1, 3, 5 or 7 and SEQ ID NO:2, 4, 6 or 8 respectively. It is understood that sequences 
recited in the Sequence Listing below as "DNA" represent an exemplification of one 
embodiment of the invention, since those of ordinary skill will recognize that such 
sequences can be usefully employed in polynucleotides in general, including 
ribopolynucleotides. 

Polypeptides 

In one aspect of the invention there are provided polypeptides of Moraxella catarrhalis 
referred to herein as "BASB034" and "BASB034 polypeptides" as well as biologically. 
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diagnostically, prophylactically, clinically or therapeutically useful variants thereof, and 
compositions comprising the same. 



The present invention further provides for: 

(a) an isolated polypeptide which comprises an amino acid sequence which has at least 
85% identity, preferably at least 90% identity, more preferably at least 95% identity, most 
preferably at least 97-99% or exact identity, to that of SEQ ID NO:2, 4, 6 or 8; 

(b) a polypeptide encoded by an isolated polynucleotide comprising a polynucleotide 
sequence which has at least 85% identity, preferably at least 90% identity, more 
preferably at least 95% identity, even more preferably at least 97-99% or exact identity to. 
SEQ ID NO:l, 3, 5 or 7 over the entire length of SEQ ID NO:l, 3, 5 or 7 respectively; or 

(c) a polypeptide encoded by an isolated polynucleotide comprising a polynucleotide 
sequence encoding a polypeptide which has at least 85% identity, preferably at least 90% 
identity, more preferably at least 95% identity, even more preferably at least 97-99% or 
exact identity, to the amino acid sequence of SEQ ID NO:2, 4, 6 or 8. 

The BASB034 polypeptides provided in SEQ ID NO:2, 4, 6 or 8 are the BASB034 
polypeptides from Moraxelta catarrhal strains Mc2931 (ATCC 43617), Mc2908, 
Mc2913andMc2969. 

The invention also provides an immunogenic fragment of a B ASB034 polypeptide, that 
is, a contiguous portion of the BASB034 polypeptide which has the same or substantially 
the same immunogenic activity as the polypeptide comprising the amino acid sequence of 
SEQ ID NO:2, 4, 6 or 8; That is to say, the fragment (if necessary when coupled to a 
carrier) is capable of raising an immune response which recognises the B ASB034 
polypeptide. Such an immunogenic fragment may include, for example, the B ASB034 
polypeptide lacking an N-terminal leader sequence, and/or a transmembrane domain 
and/or a C-terminal anchor domain. In a preferred aspect the immunogenic fragment of 
BASB034 according to the invention comprises substantially all of the extracellular 
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domain of a polypeptide which has at least 85% identity, preferably at least 90% identity, 
more preferably at least 95% identity, most preferably at least 97-99% identity, to that 
of SEQ ID NO:2, 4, 6 or 8 over the entire length of SEQ ID NO:2 

A fragment is a polypeptide having an amino acid sequence that is entirely the same as part 
but not all of any amino acid sequence of any polypeptide of the invention. As with 
BASB034 polypeptides, fragments may be "free-standing," or comprised within a larger 
polypeptide of which they form a part or region, most preferably as a single continuous 
region in a single larger polypeptide. 

Preferred fragments include, for example, truncation polypeptides having a portion of an 
amino acid sequence of SEQ ID NO:2, 4, 6 or 8 or of variants thereof, such as a continuous 
series of residues that includes an amino- and/or carboxyl-terminal amino acid sequence. 
Degradation forms of the polypeptides of the invention produced by or in a host cell, are 
also preferred. Further preferred are fragments characterized by structural or functional 
attributes such as fragments that comprise alpha-helix and alpha-helix forming regions, 
beta-sheet and beta-sheet-forming regions, turn and turn-forming regions, coil and coil- 
forming regions, hydrophilic regions, hydrophobic regions, alpha amphipathic regions, beta 
amphipathic regions, flexible regions, surface-forming regions, substrate binding region, 
and high antigenic index regions. 

Further preferred fragments include an isolated polypeptide comprising an amino acid 
sequence having at least 15, 20, 30, 40, 50 or 100 contiguous amino acids from the amino 
acid sequence of SEQ ID NO:2, 4, 6 or 8, or an isolated polypeptide comprising an amino 
acid sequence having at least 15, 20, 30, 40, 50 or 100 contiguous amino acids truncated 
or deleted from the amino acid sequence of SEQ ID NO:2, 4, 6 or 8. 

Fragments of the polypeptides of the invention may be employed for producing the 
corresponding full-length polypeptide by peptide synthesis; therefore, these fragments 
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may be employed as intermediates for producing the full-length polypeptides of the 
invention. 

Particularly preferred are variants in which several, 5-10, 1-5, 1-3, 1-2 or 1 amino acids 
are substituted, deleted, or added in any combination. 

The polypeptides, or immunogenic fragments, of the invention may be in the form of 
the "mature" protein or may be a part of a larger protein such as a precursor or a fusion 
protein. It is often advantageous to include an additional amino acid sequence which 
contains secretory or leader sequences, pro-sequences, sequences which aid in 
purification such as multiple histidine residues, or an additional sequence for stability 
during recombinant production. Furthermore, addition of exogenous polypeptide or 
lipid tail or polynucleotide sequences to increase the immunogenic potential of the final 
molecule is also considered. 

In one aspect, the invention relates to genetically engineered soluble fusion proteins 
comprising a polypeptide of the present invention, or a fragment thereof, and various 
portions of the constant regions of heavy or light chains of immunoglobulins of various 
subclasses (IgG, IgM, IgA, IgE). Preferred as an immunoglobulin is the constant part of 
the heavy chain of human IgG, particularly IgGl, where fusion takes place at the hinge 
region. In a particular embodiment, the Fc part can be removed simply by incorporation 
of a cleavage sequence which can be cleaved with blood clotting factor Xa. 

Furthermore, this invention relates to processes for the preparation of these fusion 
proteins by genetic engineering, and to the use thereof for drug screening, diagnosis and 
therapy. A further aspect of the invention also relates to polynucleotides encoding such 
fusion proteins. Examples of fusion protein technology can be found in International 
Patent Application Nos. W094/29458 and W094/22914. 
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The proteins may be chemically conjugated, or expressed as recombinant fusion 
proteins allowing increased levels to be produced in an expression system as compared 
to non-fused protein. The fusion partner may assist in providing T helper epitopes 
(immunological fusion partner), preferably T helper epitopes recognised by humans, or 
5 assist in expressing the protein (expression enhancer) at higher yields than the native 
recombinant protein. Preferably the fusion partner will be both an immunological 
fusion partner and expression enhancing partner. 

Fusion partners include protein D from Haemophilus influenzae and the non-structural 
1 0 protein from influenzae virus, NS 1 (hemagglutinin). Another fusion partner is the 

protein known as LytA. Preferably the C terminal portion of the molecule is used. Lyta 
is derived from Streptococcus pneumoniae which synthesize an N-acetyl-L-alanine 
amidase, amidase LytA, (coded by the lytA gene {Gene, 43 (1986) page 265-272}) an 
autolysin that specifically degrades certain bonds in the peptidoglycan backbone. The 
1 5 C-terminal domain of the LytA protein is responsible for the affinity to the choline or to 
some choline analogues such as DEAE. This property has been exploited for the 
development of E.coli C-LytA expressing plasmids useful for expression of fusion 
proteins. Purification of hybrid proteins containing the C-LytA fragment at its amino 
terminus has been described {Biotechnology: 10, (1992) page 795-798}. It is possible 
20 to use the repeat portion of the LytA molecule found in the C terminal end starting at 
residue 178, for example residues 188 - 305. 

The present invention also includes variants of the aforementioned polypeptides, that is 
polypeptides that vary from the referents by conservative amino acid substitutions, 
25 whereby a residue is substituted by another with like characteristics. Typical such 
substitutions are among Ala. Val. Leu and He; among Ser and Thr. among the acidic 
residues Asp and Glu: among Asn and Gin; and among the basic residues Lys and Arg; or 
aromatic residues Phe and Tyr. 
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Polypeptides of the present invention can be prepared in any suitable manner. Such 
polypeptides include isolated naturally occurring polypeptides, recombinantly produced 
polypeptides, synthetically produced polypeptides, or polypeptides produced by a 
combination of these methods. Means for preparing such polypeptides are well 
5 understood in the art. 

It is most preferred that a polypeptide of the invention is derived from Moraxella 
catarrhalis, however, it may preferably be obtained from other organisms of the same 
taxonomic genus. A polypeptide of the invention may also be obtained, for example, from 
1 0 organisms of the same taxonomic family or order. 

Polynucleotides 

It is an object of the invention to provide polynucleotides that encode BASB034 
polypeptides, particularly polynucleotides that encode the polypeptide herein designated 
15 BASB034. 

In a particularly preferred embodiment of the invention the polynucleotide comprises a 
region encoding BASB034 polypeptides comprising a sequence set out in SEQ ID NO:l, 3, 
5 or 7 which includes a full length gene, or a variant thereof. 

20 

The BASB034 polynucleotides provided in SEQ ID NO:l, 3, 5 or 7 are the BASB034 
polynucleotides from Moraxella catarrhalis strains Mc293 1 (ATCC 43617), Mc2908, 
Mc29l3 andMc2969. 

25 As a further aspect of the invention there are provided isolated nucleic acid molecules 
encoding and/or expressing BASB034 polypeptides and polynucleotides, particularly 
Moraxella catarrhalis BASB034 polypeptides and polynucleotides, including, for 
example, unprocessed RN As. ribozyme RNAs, mRNAs, cDNAs, genomic DNAs, B- 
and Z-DN As. Further embodiments of the invention include biologically. 
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diagnostically, prophylactically, clinically or therapeutically useful polynucleotides and 
polypeptides, and variants thereof, and compositions comprising the same. 

Another aspect of the invention relates to isolated polynucleotides, including at least one full 
5 length gene, that encodes a BASB034 polypeptide having a deduced amino acid sequence of 
SEQ ID NO:2, 4, 6 or 8 and polynucleotides closely related thereto and variants thereof. 

In another particularly preferred embodiment of the invention there is a BASB034 
polypeptide from Moraxella catarrhalis comprising or consisting of am amino acid 
1 0 sequence of SEQ ID NO:2, 4, 6 or 8 or a variant thereof. 

Using the information provided herein, such as a polynucleotide sequence set out in SEQ ID 
NO:l, 3, 5 or 7, a polynucleotide of the invention encoding BASB034 polypeptide may be 
obtained using standard cloning and screening methods, such as those for cloning and 

1 5 sequencing chromosomal DN A fragments from bacteria using Moraxella catarrhalis Catlin 
cells as starting material, followed by obtaining a full length clone. For example, to obtain a 
polynucleotide sequence of the invention, such as a polynucleotide sequence given in 
SEQ ID NO: 1, 3, 5 or 7, typically a library of clones of chromosomal DNA of Moraxella 
catarrhalis Catlin in Kcoli or some other suitable host is probed with a radiolabeled 

20 oligonucleotide, preferably a 17-mer or longer, derived from a partial sequence. Clones 
carrying DNA identical to that of the probe can then be distinguished using stringent 
hybridization conditions. By sequencing the individual clones thus identified by 
hybridization with sequencing primers designed from the original polypeptide or 
polynucleotide sequence it is then possible to extend the polynucleotide sequence in both 

25 directions to determine a full length gene sequence. Conveniently, such sequencing is 
performed, for example, using denatured double stranded DNA prepared from a plasmid 
clone. Suitable techniques are described by Maniatis. T„ Fritsch. E.F. and Sambrook et 
aL MOLECULAR CLONING. A LABORATORY MANUAL, 2nd Ed.; Cold Spring Harbor 
Laboratory Press, Cold Spring Harbor, New York (1989). (see in particular Screening By 
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Hybridization 1.90 and Sequencing Denatured Double-Stranded DNA Templates 13.70). 
Direct genomic DNA sequencing may also be performed to obtain a full length gene 
sequence. Illustrative of the invention, each polynucleotide set out in SEQ ID NO:l, 3, 5 or 
7 was discovered in a DNA library derived from Moraxella catarrhalis. 

Moreover, each DNA sequence set out in SEQ ID NO:l, 3, 5 or 7 contains an open reading 
frame encoding a protein having about the number of amino acid residues set forth in SEQ 
ID NO:2, 4, 6 or 8 with a deduced molecular weight that can be calculated using amino acid 
residue molecular weight values well known to those skilled in the art. 

The polynucleotide of SEQ ID NO:l, between the start codon at nucleotide number 1 and 
the stop codon which begins at nucleotide number 1327 of SEQ ID NO: 1 , encodes the 
polypeptide of SEQ ID NO:2. 

The polynucleotide of SEQ ID NO:3, between the start codon at nucleotide number 1 and 
the stop codon which begins at nucleotide number 1327 of SEQ ID NO:3, encodes the 
polypeptide of SEQ ID NO:4. 

The polynucleotide of SEQ ID NO:5, between the start codon at nucleotide number 1 and 
the stop codon which begins at nucleotide number 1327 of SEQ ID NO:5, encodes the 
polypeptide of SEQ ID NO:6. 

The polynucleotide of SEQ ID NO:7, between the start codon at nucleotide number 1 and 
the stop codon which begins at nucleotide number 1327 of SEQ ID NO:7, encodes the 
polypeptide of SEQ ID NO:8. 

In a further aspect, the present invention provides for an isolated polynucleotide 
comprising or consisting of: 

(a) a polynucleotide sequence which has at least 85% identity, preferably at least 90% 
identity, more preferably at least 95% identity, even more preferably at least 97-99% or 
exact identity to SEQ ID NO: 1 , 3, 5 or 7 over the entire length of SEQ ID NO: 1 , 3. 5 or 7 
respectively; or 
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(b) a polynucleotide sequence encoding a polypeptide which has at least 85% identity, 
preferably at least 90% identity, more preferably at least 95% identity, even more 
preferably at least 97-99% or 100% exact, to the amino acid sequence of SEQ ID NO:2, 
4, 6 or 8, over the entire length of SEQ ID NO:2, 4, 6 or 8 respectively. 

A polynucleotide encoding a polypeptide of the present invention, including homologs and 
orthologs from species other than Moraxella catarrhalis, may be obtained by a process 
which comprises the steps of screening an appropriate library under stringent hybridization 
conditions (for example, using a temperature in the range of 45 - 65°C and an SDS 
concentration from 0.1 - 1%) with a labeled or detectable probe consisting of or comprising 
the sequence of SEQ ID NO:l, 3, 5 or 7 or a fragment thereof; and isolating a full-length 
gene and/or genomic clones containing said polynucleotide sequence. 

The invention provides a polynucleotide sequence identical over its entire length to a coding 
sequence (open reading frame) in SEQ ID NO:l, 3, 5 or 7. Also provided by the invention 
is a coding sequence for a mature polypeptide or a fragment thereof, by itself as well as a 
coding sequence for a mature polypeptide or a fragment in reading frame with another 
coding sequence, such as a sequence encoding a leader or secretory sequence, a pre-, or pro- 
or prepro-protein sequence. The polynucleotide of the invention may also contain at least 
one non-coding sequence, including for example, but not limited to at least one non-coding 
y and 3' sequence, such as the transcribed but non-translated sequences, termination signals 
(such as rho-dependent and rho-independent termination signals), ribosome binding sites, 
Kozak sequences, sequences that stabilize mRNA, introns, and polyadenylation signals. 
The polynucleotide sequence may also comprise additional coding sequence encoding 
additional amino acids. For example, a marker sequence that facilitates purification of the 
fused polypeptide can be encoded. In certain embodiments of the invention, the marker 
sequence is a hexa-histidine peptide, as provided in the pQE vector (Qiagen. Inc.) and 
described in Gentz at «/.. Prac. Natl. Acad ScL USA 86: 821-824 (1989). or an HA peptide 
tag (Wilson et a/.. Cell. 37; 767 (1984). both of which may be useful in purifying 
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polypeptide sequence fused to them. Polynucleotides of the invention also include, but are 
not limited to, polynucleotides comprising a structural gene and its naturally associated 
sequences that control gene expression. 

The nucleotide sequence encoding BASB034 polypeptide of SEQ ID NO:2, 4, 6 or 8 may 
be identical to the polypeptide encoding sequence contained in nucleotides 1 to 1326 of 
SEQ ID NO:L 3, 5 or 7 respectively. Alternatively it may be a sequence, which as a 
result of the redundancy (degeneracy) of the genetic code, also encodes the polypeptide of 
SEQIDNO:2, 4, 6 or 8. 

The term "polynucleotide encoding a polypeptide" as used herein encompasses 
polynucleotides that include a sequence encoding a polypeptide of the invention, 
particularly a bacterial polypeptide and more particularly a polypeptide of the Moraxella 
catarrhalis BASB034 having an amino acid sequence set out in SEQ ID NO:2, 4, 6 or 8. 
The term also encompasses polynucleotides that include a single continuous region or 
discontinuous regions encoding the polypeptide (for example, polynucleotides interrupted 
by integrated phage, an integrated insertion sequence, an integrated vector sequence, an 
integrated transposon sequence, or due to RNA editing or genomic DN A reorganization) 
together with additional regions, that also may contain coding and/or non-coding sequences. 

The invention further relates to variants of the polynucleotides described herein that encode 
variants of a polypeptide having a deduced amino acid sequence of SEQ ID NO:2, 4, 6 or 8. 
Fragments of polynucleotides of the invention may be used, for example, to synthesize full- 
length polynucleotides of the invention. 

Further particularly preferred embodiments are polynucleotides encoding BASB034 
variants, that have the amino acid sequence of BASB034 polypeptide of SEQ ID NO:2. 4. 6 
or 8 in which several, a few, 5 to 10. I to 5. 1 to 3, 2. 1 or no amino acid residues are 
substituted, modified, deleted and/or added, in any combination. Especially preferred 
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among these are silent substitutions, additions and deletions, that do not alter the properties 
and activities of BASB034 polypeptide. 

Further preferred embodiments of the invention are polynucleotides that are at least 85% 
identical over their entire length to a polynucleotide encoding B ASB034 polypeptide having 
an amino acid sequence set out in SEQ ID NO:2, 4, 6 or 8, and polynucleotides that are 
complementary to such polynucleotides. Alternatively, most highly preferred are 
polynucleotides that comprise a region that is at least 90% identical over its entire length to 
a polynucleotide encoding B ASB034 polypeptide and polynucleotides complementary 
thereto. In this regard, polynucleotides at least 95% identical over their entire length to the 
same are particularly preferred. Furthermore, those with at least 97% are highly preferred 
among those with at least 95%, and among these those with at least 98% and at least 99% 
are particularly highly preferred, with at least 99% being the more preferred. 

Preferred embodiments are polynucleotides encoding polypeptides that retain substantially 
the same biological function or activity as the mature polypeptide encoded by a DNA of 
SEQIDNO:l,3,5or7. 

In accordance with certain preferred embodiments of this invention there are provided 
polynucleotides that hybridize, particularly under stringent conditions, to BASB034 
polynucleotide sequences, such as those polynucleotides in SEQ ID NO:l, 3, 5 or 7. 

The invention further relates to polynucleotides that hybridize to the polynucleotide 
sequences provided herein. In this regard, the invention especially relates to 
polynucleotides that hybridize under stringent conditions to the polynucleotides described 
herein. As herein used, the terms "stringent conditions" and "stringent hybridization 
conditions" mean hybridization occurring only if there is at least 95% and preferably at least 
97% identity between the sequences. A specific example of stringent hybridization 
conditions is overnight incubation at 42°C in a solution comprising: 50% formamide, 5x 
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SSC (150mM NaCl, 15mM trisodium citrate), 50 mM sodium phosphate (pH7.6), 5x 
Denhardt's solution, 1 0% dextran sulfate, and 20 micrograms/ml of denatured, sheared 
salmon sperm DNA, followed by washing the hybridization support in O.lx SSC at about 
65°C. Hybridization and wash conditions are well known and exemplified in Sambrook, 
et al.. Molecular Cloning: A Laboratory Manual, Second Edition, Cold Spring Harbor, 
N.Y., (1989), particularly Chapter 1 1 therein. Solution hybridization may also be used 
with the polynucleotide sequences provided by the invention. 

The invention also provides a polynucleotide consisting of or comprising a polynucleotide 
sequence obtained by screening an appropriate library containing the complete gene for a 
polynucleotide sequence set forth in SEQ ID NO:l, 3, 5 or 7 under stringent hybridization 
conditions with a probe having the sequence of said polynucleotide sequence set forth in 
SEQ ID NO:l, 3, 5 or 7 or a fragment thereof; and isolating said polynucleotide sequence. 
Fragments useful for obtaining such a polynucleotide include, for example, probes and 
primers fully described elsewhere herein. 

As discussed elsewhere herein regarding polynucleotide assays of the invention, for 
instance, the polynucleotides of the invention, may be used as a hybridization probe for 
RNA, cDNA and genomic DNA to isolate full-length cDNAs and genomic clones encoding 
BASB034 and to isolate cDNA and genomic clones of other genes that have a high identity, 
particularly high sequence identity, to the BASB034 gene. Such probes generally will 
comprise at least 15 nucleotide residues or base pairs. Preferably, such probes will have at 
least 30 nucleotide residues or base pairs and may have at least 50 nucleotide residues or 
base pairs. Particularly preferred probes will have at least 20 nucleotide residues or base 
pairs and will have less than 30 nucleotide residues or base pairs. 

A coding region of a BASB034 gene may be isolated by screening using a DNA sequence 
provided in SEQ ID NO: I, 3. 5 or 7 to synthesize an oligonucleotide probe. A labeled 
oligonucleotide having a sequence complementary to that of a gene of the invention is then 
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used to screen a library of cDNA, genomic DNA or mRNA^o determine which members of 

ifae l.ibrarv the probe hybridizes to. 

The, . -ediods available and well known /o those skilled in the art to obtain 

5 fulll ., DNAs, or extend short DNAs, for examj/le those based on the method of 
Rap; . I , Vacation of cDNA ends (RACE) (see Jor example, Frohman, et ai, PNAS 
USA >98-9002, 1988). Recent modificatior/s of the technique, exemplified by the 
Mar , ^ technology (Clontech LaboratorieXc.) for example, have significantly 
simp d the search for longer cDNAs. In f Marathon- technology, cDNAs have 
l0 been ared from mRNA extracted from/chosen tissue and an 'adaptor sequence 

liga ted onto each end. Nucleic acid amplication (PCR) is then carried out to amplify the 
"missing" 5' end of the DNA using a cor/bination of gene specific and adaptor specific 
oUgc ide primers. The PCR rea/tion is then repeated using "nested" primers, that 

is pr ,ers signed to anneal withJhe amplified product (typically an adaptor specfic 
! 5 primer that anneals further 3' in th/adaptor sequence and a gene specific primer that 
anneals further 5' in the selected /ene sequence). The products of this reaction can then 
be analyzed by DNA sequencU and a full-length DNA constructed either by jommg the 
product directly to the existing DNA to give a complete sequence, or carrymg out a 
separate full-length PCR u/ng the new sequence information for the des.gn of the 5 
20 primer. 

The polynucleotides L polypeptides of the invention may be employed, for example, as 
research reagents ^materials for discovery of treatments of and diagnostics for dtfeases, 
particularly huma/diseases, as further discussed herein relating to polynucleoude assays. 
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The polynucleotide, of the invention that are oligonucleotides derived from a sequence ot 
SEQ ID NOS: I - 8 may be used in the processes herein as described, but preterably tor 
PCR to determine whether or not the polynucleotides identified herein in whole or ui part 
are transcribed in bacteria in infected tissue. It is recognized that such sequences wdl also 
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have utility in diagnosis of the stage of infection and type of infection the pathogen has 

attained. 

The invention also provides polynucleotides that encode a polypeptide that is the mature 
protein plus additional amino or carboxyl-terminal amino acids, or amino acids interior to 
the mature polypeptide (when the mature form has more than one polypeptide chain, for 
instance). Such sequences may play a role in processing of a protein from precursor to a . 
mature form, may allow protein transport, may lengthen or shorten protein half-life or may 
facilitate manipulation of a protein for assay or production, among other things. As 
generally is the case in vivo, the additional amino acids may be processed away from the 
mature protein by cellular enzymes. 

For each and every polynucleotide of the invention there is provided a polynucleotide 
complementary to it. It is preferred that these complementary polynucleotides are fully 
complementary to each polynucleotide with which they are complementary. 

A precursor protein, having a mature form of the polypeptide fused to one or more 
prosequences may be an inactive form of the polypeptide. When prosequences are removed 
such inactive precursors generally are activated. Some or all of the prosequences may be 
removed before activation. Generally, such precursors are called proproteins. 

In addition to the standard A, G, C, T/U representations for nucleotides, the term "N" may 
also be used in describing certain polynucleotides of the invention. "N" means that any of 
the four DNA or RNA nucleotides may appear at such a designated position in the DNA 
or RNA sequence, except it is preferred that N is not a nucleic acid that when taken in 
combination with adjacent nucleotide positions, when read in the correct reading frame, 
would have the effect of generating a premature termination codon in such reading frame. 
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In sum, a polynucleotide of the invention may encode a mature protein, a mature protein 
plus a leader sequence (which may be referred to as a preprotein), a precursor of a mature 
protein having one or more prosequences that are not the leader sequences of a preprotein, 
or a preproprotein, which is a precursor to a proprotein, having a leader sequence and one or 
5 more prosequences, which generally are removed during processing steps that produce 
active and mature forms of the polypeptide. 

In accordance with an aspect of the invention, there is provided the use of a 
polynucleotide of the invention for therapeutic or prophylactic purposes, in particular 
genetic immunization. 

The use of a polynucleotide of the invention in genetic immunization will preferably 
employ a suitable delivery method such as direct injection of plasmid DNA into muscles 
(Wolff et aL Hum Mol Genet (1992) 1: 363, Manthorpe et aL Hum. Gene Ther. (1983) 4: 
419), delivery of DNA complexed with specific protein carriers (Wu et al. y J Biol Chem. 
(1989) 264: 16985), coprecipitation of DNA with calcium phosphate (Benvenisty & 
Reshef, PNAS USA, (1986) 83: 9551), encapsulation of DNA in various forms of 
liposomes (Kaneda et aL Science (1989) 243: 375), particle bombardment (Tang et aL 
Nature (1992) 356:152, Eisenbraun etaL DNA Cell Biol (1993) 12: 791) and in vivo 
infection using cloned retroviral vectors (Seeger et aL, PNAS USA (1984) 81: 5849). 

Vectors, Host Cells, Expression Systems 

The invention also relates to vectors that comprise a polynucleotide or polynucleotides of 
25 the invention, host cells that are genetically engineered with vectors of the invention and the 
production of polypeptides of the invention by recombinant techniques. Cell-free 
translation systems can also be employed to produce such proteins using RNAs derived 
from the DNA constructs of the invention. 
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Recombinant polypeptides of the present invention may be prepared by processes well 
known in those skilled in the art from genetically engineered host cells comprising 
expression systems. Accordingly, in a further aspect, the present invention relates to 
expression systems that comprise a polynucleotide or polynucleotides of the present 
invention, to host cells which are genetically engineered with such expression systems, and 
to the production of polypeptides of the invention by recombinant techniques. 

For recombinant production of the polypeptides of the invention, host cells can be 
genetically engineered to incorporate expression systems or portions thereof or 
polynucleotides of the invention. Introduction of a polynucleotide into the host cell can be 
effected by methods described in many standard laboratory manuals, such as Davis, et aL, 
BASIC METHODS IN MOLECULAR BIOLOGY, ( 1 986) and Sambrook, et al , 
MOLECULAR CLONING: A LABORATORY MANUAL, 2nd Ed., Cold Spring Harbor 
Laboratory Press, Cold Spring Harbor, N.Y. (1989), such as, calcium phosphate 
transfection, DEAE-dextran mediated transfection, transvection, microinjection, cationic 
lipid-mediated transfection, electroporation, transduction, scrape loading, ballistic 
introduction and infection. 

Representative examples of appropriate hosts include bacterial cells, such as cells of 
streptococci, staphylococci, enterococci, £. co/z, streptomyces, cyanobacteria, Bacillus 
subtilis, Neisseria meningitidis and Moraxella cdtarrhalis; fungal cells, such as cells of a 
yeast, Kluveromyces, Saccharomyces, a basidiomycete, Candida albicans and Aspergillus; 
insect cells such as cells of Drosophila S2 and Spodoptera Sf9; animal cells such as CHO, 
COS, HeLa, C127, 3T3, BHK, 293, CV-l and Bowes melanoma cells; and plant cells, such 
as cells of a gymnosperm or angiosperm. 

A great variety of expression systems can be used to produce the polypeptides of the 
invention. Such vectors include, among others, chromosomal-, episomal- and virus-derived 
vectors, for example, vectors derived from bacterial plasmids. from bacteriophage, from 
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transposons, from yeast episomes, from insertion elements, from yeast chromosomal 
elements, from viruses such as baculoviruses, papova viruses, such as S V40, vaccinia 
viruses, adenoviruses, fowl pox viruses, pseudorabies viruses, picomaviruses, retroviruses, 
and alphavimses and vectors derived from combinations thereof, such as those derived from 
plasmid and bacteriophage genetic elements, such as cosmids and phagemids. The 
expression system constructs may contain control regions that regulate as well as engender 
expression. Generally, any system or vector suitable to maintain, propagate or express 
polynucleotides and/or to express a polypeptide in a host may be used for expression in this 
regard. The appropriate DNA sequence may be inserted into the expression system by any 
of a variety of well-known and routine techniques, such as, for example, those set forth in 
Sambrook et aL 7 MOLECULAR CLONING, A LABORATORY MANUAL, (supra). 

In recombinant expression systems in eukaryotes, for secretion of a translated protein into 
the lumen of the endoplasmic reticulum, into the periplasmic space or into the extracellular 
environment, appropriate secretion signals may be incorporated into the expressed 
polypeptide. These signals may be endogenous to the polypeptide or they may be 
heterologous signals. 

Polypeptides of the present invention can be recovered and purified from recombinant 
cell cultures by well-known methods including ammonium sulfate or ethanol 
precipitation, acid extraction, anion or cation exchange chromatography, phosphocellulose 
chromatography, hydrophobic interaction chromatography, affinity chromatography, 
hydroxy lapatite chromatography and lectin chromatography. Most preferably, ion metal 
affinity chromatography (IMAC) is employed for purification. Well known techniques 
for refolding proteins may be employed to regenerate active conformation when the 
polypeptide is denatured during intracellular synthesis, isolation and or purification. 

The expression system may also be a recombinant live microorganism, such sis a virus 
or bacterium. The gene of interest can be inserted into the genome of a live recombinant 
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virus or bacterium. Inoculation and in vivo infection with this live vector will lead to in 
vivo expression of the antigen and induction of immune responses. Viruses and bacteria 
used for this purpose are for instance: poxviruses (e.g; vaccinia, fowlpox, canarypox), 
alphaviruses (Sindbis virus, Semliki Forest Virus, Venezuelian Equine Encephalitis 
Virus), adenoviruses, adeno-associated virus, picomaviruses (poliovirus, rhinovirus), 
herpesviruses (varicella zoster virus, etc), Listeria, Salmonella , Shigella, BCG. These 
viruses and bacteria can be virulent, or attenuated in various ways in order to obtain live 
vaccines. Such live vaccines also form part of the invention. 



Diagnostic, Prognostic, Serotyping and Mutation Assays 

This invention is also related to the use of BASB034 polynucleotides and polypeptides of 
the invention for use as diagnostic reagents. Detection of BASB034 polynucleotides and/or 
polypeptides in a eukaryote, particularly a mammal, and especially a human, will provide a 
diagnostic method for diagnosis of disease, staging of disease or response of an infectious 
organism to drugs. Eukaryotes, particularly mammals, and especially humans, particularly 
those infected or suspected to be infected with an organism comprising the BASB034 gene 
or protein, may be detected at the nucleic acid or amino acid level by a variety of well 
known techniques as well as by methods provided herein. 

Polypeptides and polynucleotides for prognosis, diagnosis or other analysis may be obtained 
from a putatively infected and/or infected individual's bodily materials. Polynucleotides 
from any of these sources, particularly DNA or RNA, may be used directly for detection or 
may be amplified enzymatically by using PGR or any other amplification technique prior to 
analysis. RNA, particularly mRNA, cDNA and genomic DNA may also be used in the 
same ways. Using amplification, characterization of the species and strain of infectious or 
resident organism present in an individual, may be made by an analysis of the genotype of a 
selected polynucleotide of the organism. Deletions and insertions can be detected by a 
change in size of the amplified product in comparison to a genotype of a reference sequence 
selected from a related organism, preferably a different species of the same genus or a 
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different strain of the same species. Point mutations can be identified by hybridizing 
amplified DNA to labeled BASB034 polynucleotide sequences. Perfectly or significantly 
matched sequences can be distinguished from imperfectly or more significantly mismatched 
duplexes by DNase or RNase digestion, for DNA or RNA respectively, or by detecting 
differences in melting temperatures or renaturation kinetics. Polynucleotide sequence 
differences may also be detected by alterations in the electrophoretic mobility of 
polynucleotide fragments in gels as compared to a reference sequence. This may be carried 
out with or without denaturing agents. Polynucleotide differences may also be detected by 
direct DNA or RNA sequencing. See, for example, Myers et aL, Science, 230: 1242 (1985). 
Sequence changes at specific locations also may be revealed by nuclease protection assays, 
such as RNase, V I and S 1 protection assay or a chemical cleavage method. See, for 
example, Cotton et a/., Proc. NatL Acad. ScL, USA, 85: 4397-4401 (1985). 

In another embodiment, an array of oligonucleotides probes comprising BASB034 
nucleotide sequence or fragments thereof can be constructed to conduct efficient screening 
of, for example, genetic mutations, serotype, taxonomic classification or identification. 
Array technology methods are well known and have general applicability and can be used to 
address a variety of questions in molecular genetics including gene expression, genetic 
linkage, and genetic variability (see, for example, Chee et aL, Science, 274: 610 (1996)). 

Thus in another aspect, the present invention relates to a diagnostic kit which comprises: 

(a) a polynucleotide of the present invention, preferably the nucleotide sequence of SEQ 
ID NO: I, 3, 5 or 7, or a fragment thereof ; 

(b) a nucleotide sequence complementary to that of (a); 

(c) a polypeptide of the present invention, preferably the polypeptide of SEQ ID NO:2, 4, 
6 or 8 or a fragment thereof; or 

(d) an antibody to a polypeptide of the present invention, preferably to the polypeptide of 
SEQ ID NO:2, 4, 6 or 8. 
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It will be appreciated that in any such kit, (a), (b), (c) or (d) may comprise a substantial 
component. Such a kit will be of use in diagnosing a disease or susceptibility to a 
Disease, among others. 

This invention also relates to the use of polynucleotides of the present invention as 
diagnostic reagents. Detection of a mutated form of a polynucleotide of the invention, 
preferable, SEQ ID NO: 1 , 3, 5 or 7, which is associated with a disease or pathogenicity will 
provide a diagnostic tool that can add to, or define, a diagnosis of a disease, a prognosis of a 
course of disease, a determination of a stage of disease, or a susceptibility to a disease, 
which results from under-expression, over-expression or altered expression of the 
polynucleotide. Organisms, particularly infectious organisms, carrying mutations in such 
polynucleotide may be detected at the polynucleotide level by a variety of techniques, such 
as those described elsewhere herein. 

Cells from an organism carrying mutations or polymorphisms (allelic variations) in a 
polynucleotide and/or polypeptide of the invention may also be detected at the 
polynucleotide or polypeptide level by a variety of techniques, to allow for serotyping, for 
example. For example, RT-PCR can be used to detect mutations in the RNA. It is 
particularly preferred to use RT-PCR in conjunction with automated detection systems, such 
as, for example, GeneScan. RN A, cDNA or genomic DN A may also be used for the same 
purpose, PCR. As an example, PCR primers complementary to a polynucleotide encoding 
BASB034 polypeptide can be used to identify and analyze mutations. 

The invention further provides primers with 1, 2, 3 or 4 nucleotides removed from the 5' 
and/or the 3' end. These primers may be used for, among other things, amplifying 
BASB034 DNA and/or RNA isolated from a sample derived from an individual, such as a 
bodily material. The primers may be used to amplify a polynucleotide isolated from an 
infected individual, such that the polynucleotide may then be subject to various techniques 
for elucidation of the polynucleotide sequence. In this way. mutations in the polynucleotide 
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sequence may be detected and used to diagnose and/or prognose the infection or its stage or 
course, or to serotype and/or classify the infectious agent. 

The invention further provides a process for diagnosing, disease, preferably bacterial 
infections, more preferably infections caused by Moroxella catarrhalis, comprising 
determining from a sample derived from an individual, such as a bodily material, an 
increased level of expression of polynucleotide having a sequence of SEQ ID NO:i, 3, 5 
or 7. Increased or decreased expression of a BASB034 polynucleotide can be measured 
using any on of the methods well known in the art for the quantitation of polynucleotides, 
such as, for example, amplification, PCR, RT-PCR, RNase protection, Northern blotting, 
spectrometry and other hybridization methods. 

In addition, a diagnostic assay in accordance with the invention for detecting over- 
expression of BASB034 polypeptide compared to normal control tissue samples may be 
used to detect the presence of an infection, for example. Assay techniques that can be used 
to determine levels of a BASB034 polypeptide, in a sample derived from a host, such as a 
bodily material, are well-known to those of skill in the art Such assay methods include 
radioimmunoassays, competitive-binding assays, Western Blot analysis, antibody sandwich 
assays, antibody detection and ELISA assays. 

The polynucleotides of the invention may be used as"components of polynucleotide 
arrays, preferably high density arrays or grids. Th£se high density arrays are 
particularly useful for diagnostic and prognostic/purposes. For example, a set of spots 
each comprising a different gene, and further comprising a polynucleotide or 
polynucleotides of the invention, may be us^d for probing, such as using hybridization 
or nucleic acid amplification, using a probes obtained or derived from a bodily sample, 
to determine the presence of a particular polynucleotide sequence or related sequence in 
an individual. Such a presence mayyindicate the presence of a pathogen, particularly 
Moroxella catarrhalis. and may be useful in diagnosing and/or prognosing disease or a 
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course of disease. A grid comprisirm^riurnber of variants of the polynucleotide 
sequence of SEQ ID NO: 1 , 3^>w 7 are preferred. Also preferred is a comprising a 
number of variants ofjj^olynucleotide sequence encoding the polypeptide sequence of 
SEQ ID NO:2, 4^or 8. 

5 

Antibodies 

The polypeptides and polynucleotides of the invention or variants thereof, or cells 
expressing the same can be used as immunogens to produce antibodies immunospecific for 
such polypeptides or polynucleotides respectively. 

10 

In certain preferred embodiments of the invention there are provided antibodies against 
BASB034 polypeptides or polynucleotides. 

Antibodies generated against the polypeptides or polynucleotides of the invention can be 
1 5 obtained by administering the polypeptides and/or polynucleotides of the invention, or 

epitope-bearing fragments of either or both, analogues of either or both, or cells expressing 
either or both, to an animal, preferably a nonhumart, using routine protocols. For 
preparation of monoclonal antibodies, any technique known in the art that provides 
antibodies produced by continuous cell line cultures can be used. Examples include various 
20 techniques, such as those in Kohler, G. and Milstein, C, Nature 256: 495-497 ( 1 975); 

Kozbor et aijmmurtology Today 4: 72 (1983); Cole et al. 9 pg. 77-96 in MONOCLONAL 
ANTIBODIES AND CANCER THERAPY, Alan R. Liss, Inc. (1985). 

Techniques for the production of single chain antibodies (U.S. Patent No. 4,946,778) can be 
25 adapted to produce single chain antibodies to polypeptides or polynucleotides of this 

invention. Also, transgenic mice, or other organisms or animals, such as other mammals, 
may be used to express humanized antibodies immunospecific to the polypeptides or 
polynucleotides of the invention. 
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Alternatively, phage display technology may be utilized to select antibody genes with 
binding activities towards a polypeptide of the invention either from repertoires of PCR 
amplified v-genes of lymphocytes from humans screened for possessing anti-BASB034 or 
from naive libraries (McCafferty, et al y (1990), Nature 348, 552-554; Marks, et ai y 
( 1 992) Biotechnology 10, 779-783). The affinity of these antibodies can also be improved 
by, for example, chain shuffling (Clackson et al. y (1991) Nature 352: 628). 

The above-described antibodies may be employed to isolate or to identify clones expressing 
the polypeptides or polynucleotides of the invention to purify the polypeptides or 
polynucleotides by, for example, affinity chromatography. 

Thus, among others, antibodies against BASB034-polypeptide or B ASB034-polynucleotide 
may be employed to treat infections, particularly bacterial infections. 

Polypeptide variants include antigenically, epitopically or immunologically equivalent 
variants form a particular aspect of this invention. 

Preferably, the antibody or variant thereof is modified to make it less immunogenic in the 
individual. For example, if the individual is human the antibody may most preferably be 
"humanized," where the complimentarity determining region or regions of the hybridoma- 
derived antibody has been transplanted into a human monoclonal antibody, for example 
as described in Jones et ai (1986), Nature 321, 522-525 or Tempest et a/., (1991) 
Biotechnology 9, 266-273. 

Antagonists and Agonists - Assays and Molecules 

Polypeptides and polynucleotides of the invention may also be used to assess the binding of 
small molecule substrates and ligands in, for example, cells, cell-free preparations, chemical 
libraries, and natural product mixtures. These substrates and ligands may be natural 
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substrates and ligands or may be structural or functional mimetics. See, e.g., Coligan et ai, 
Current Protocols in Immunology 1(2): Chapter 5 (1991). 

The screening methods may simply measure the binding of a candidate compound to the 
polypeptide or polynucleotide, or to cells or membranes bearing the polypeptide or 
polynucleotide, or a fusion protein of the polypeptide by means of a label directly or 
indirectly associated with the candidate compound. Alternatively, the screening method 
may involve competition with a labeled competitor. Further, these screening methods - 
may test whether the candidate compound results in a signal generated by activation or 
inhibition of the polypeptide or polynucleotide, using detection systems appropriate to the 
cells comprising the polypeptide or polynucleotide. Inhibitors of activation are generally 
assayed in the presence of a known agonist and the effect on activation by the agonist by 
the presence of the candidate compound is observed. Constitutively active polypeptide 
and/or constitutively expressed polypeptides and polynucleotides may be employed in 
screening methods for inverse agonists or inhibitors, in the absence of an agonist or 
inhibitor, by testing whether the candidate compound results in inhibition of activation of 
the polypeptide or polynucleotide, as the case may be. Further, the screening methods 
may simply comprise the steps of mixing a candidate compound with a solution 
containing a polypeptide or polynucleotide of the present invention, to form a mixture, 
measuring BASB034 polypeptide and/or polynucleotide activity in the mixture, and 
comparing the BASB034 polypeptide and/or polynucleotide activity of the mixture to a 
standard. Fusion proteins, such as those made from Fc portion and BASB034 
polypeptide, as hereinbefore described, can also be used for high-throughput screening 
assays to identify antagonists of the polypeptide of the present invention, as well as of 
phylogenetically and and/or functionally related polypeptides (see D. Bennett et a/.. J Mol 
Recognition, 8:52-58 (1995); and K. Johanson et aL. J Biol Chem. 270(16):9459-947l 
(1995)). 



-27- 



7* 



WO 00/15802 



PCT/EP99/0678I 



The polynucleotides, polypeptides and antibodies that bind to and/or interact with a 
polypeptide of the present invention may also be used to configure screening methods for 
detecting the effect of added compounds on the production of mRNA and/or polypeptide 
in cells. For example, an ELISA assay may be constructed for measuring secreted or cell 
associated levels of polypeptide using monoclonal and polyclonal antibodies by standard 
methods known in the art. This can be used to discover agents which may inhibit or 
enhance the production of polypeptide (also called antagonist or agonist, respectively) 
from suitably manipulated cells or tissues. 

The invention also provides a method of screening compounds to identify those which 
enhance (agonist) or block (antagonist) the action of BASB034 polypeptides or 
polynucleotides, particularly those compounds that are bacteriostatic and/or bactericidal. 
The method of screening may involve high-throughput techniques. For example, to screen 
for agonists or antagonists, a synthetic reaction mix, a cellular compartment, such as a 
membrane, cell envelope or cell wall, or a preparation of any thereof, comprising BASB034 
polypeptide and a labeled substrate or ligand of such polypeptide is incubated in the absence 
or the presence of a candidate molecule that may be a BASB034 agonist or antagonist. The 
ability of the candidate molecule to agonize or antagonize the BASB034 polypeptide is 
reflected in decreased binding of the labeled ligand or decreased production of product from 
such substrate. Molecules that bind gratuitously, Le. , without inducing the effects of 
B ASB034 polypeptide are most likely to be good antagonists. Molecules that bind well 
and, as the case may be, increase the rate of product production from substrate, increase 
signal transduction, or increase chemical channel activity are agonists. Detection of the rate 
or level of, as the case may be, production of product from substrate, signal transduction, or 
chemical channel activity may be enhanced by using a reporter system. Reporter systems 
that may be useful in this regard include but are not limited to colorimetric, labeled substrate 
converted into product, a reporter gene that is responsive to changes in BASB034 
polynucleotide or polypeptide activity, and binding assays known in the art. 
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Another example of an assay for BASB034 agonists is a competitive assay that combines 
BASB034 and a potential agonist with BASB034-binding molecules, recombinant 
BASB034 binding molecules, natural substrates or ligands, or substrate or ligand mimetics, 
under appropriate conditions for a competitive inhibition assay. BASB034 can be labeled, 
5 such as by radioactivity or a colorimetric compound, such that the number of B ASB034 
molecules bound to a binding molecule or converted to product can be determined 
accurately to assess the effectiveness of the potential antagonist. 

Potential antagonists include, among others, small organic molecules, peptides, 
polypeptides and antibodies that bind to a polynucleotide and/or polypeptide of the 
invention and thereby inhibit or extinguish its activity or expression. Potential antagonists 
also may be small organic molecules, a peptide, a polypeptide such as a closely related 
protein or antibody that binds the same sites on a binding molecule, such as a binding 
molecule, without inducing B ASB034-induced activities, thereby preventing the action or 
expression of BASB034 polypeptides and/or polynucleotides by excluding BASB034 
polypeptides and/or polynucleotides from binding. 

Potential antagonists include a small molecule that binds to and occupies the binding site of 
the polypeptide thereby preventing binding to cellular binding molecules, such that normal 
biological activity is prevented. Examples of small molecules include but are not limited to 
small organic molecules, peptides or peptide-like molecules. Other potential antagonists 
include antisense molecules (see Okano, J. Neurochem. 56: 560 (1991); 
OUGODEOXYNUCLEOTIDESAS ANTISENSE INHIBITORS OF GENE EXPRESSION, 
CRC Press, Boca Raton, FL (1988), for a description of these molecules). Preferred 
potential antagonists include compounds related to and variants of BASB034. 

In a further aspect, the present invention relates to genetically engineered soluble fusion 
proteins comprising a polypeptide of the present invention, or a fragment thereof, and 
various portions of the constant regions of heavy or light chains of immunoglobulins of 
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various subclasses (IgG, IgM, IgA, IgE). Preferred as an immunoglobulin is the constant 
part of the heavy chain of human IgG, particularly IgGl, where fusion takes place at the 
hinge region. In a particular embodiment, the Fc part can be removed simply by 
incorporation of a cleavage sequence which can be cleaved with blood clotting factor Xa. 
Furthermore, this invention relates to processes for the preparation of these fusion 
proteins by genetic engineering, and to the use thereof for drug screening, diagnosis and 
therapy. A further aspect of the invention also relates to polynucleotides encoding such 
fusion proteins. Examples of fusion protein technology can be found in International 
Patent Application Nos. W094/29458 and W094/22914. 

Each of the polynucleotide sequences provided herein may be used in the discovery and 
development of antibacterial compounds. The encoded protein, upon expression, can be 
used as a target for the screening of antibacterial drugs. Additionally, the polynucleotide 
sequences encoding the amino terminal regions of the encoded protein or Shine-Delgarno 
or other translation facilitating sequences of the respective mRNA can be used to 
construct antisense sequences to control the expression of the coding sequence of interest. 

The invention also provides the use of the polypeptide, polynucleotide, agonist or 
antagonist of the invention to interfere with the initial physical interaction between a 
pathogen or pathogens and a eukaryotic, preferably mammalian, host responsible for 
sequelae of infection. In particular, the molecules of the invention may be used: in the 
prevention of adhesion of bacteria, in particular gram positive and/or gram negative 
bacteria, to eukaryotic, preferably mammalian, extracellular matrix proteins on in- 
dwelling devices or to extracellular matrix proteins in wounds; to block bacterial adhesion 
between eukaryotic, preferably mammalian, extracellular matrix proteins and bacterial 
BASB034 proteins that mediate tissue damage and/or; to block the normal progression of 
pathogenesis in infections initiated other than by the implantation of in-dwelling devices 
or by other surgical techniques. 
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In accordance with yet another aspect of the invention, there are provided BASB034 
agonists and antagonists, preferably bacteristatic or bactericidal agonists and antagonists. 

The antagonists and agonists of the invention may be employed, for instance, to prevent, 
inhibit and/or treat diseases. 

In a further aspect, the present invention relates to mimotopes of the polypeptide of the 
invention. A mimotope is a peptide sequence, sufficiently similar to the native peptide 
(sequentially or structurally), which is capable of being recognised by antibodies which 
recognise the native peptide; or is capable of raising antibodies which recognise the 
native peptide when coupled to a suitable carrier. 

Peptide mimotopes may be designed for a particular purpose by addition, deletion or 
substitution of elected amino acids. Thus, the peptides may be modified for the purposes 
of ease of conjugation to a protein carrier. For example, it may be desirable for some 
chemical conjugation methods to include a terminal cysteine. In addition it may be 
desirable for peptides conjugated to a protein carrier to include a hydrophobic terminus 
distal from the conjugated terminus of the peptide, such that the free unconjugated end 
of the peptide remains associated with the surface of the carrier protein. Thereby 
presenting the peptide in a conformation which most closely resembles that of the 
peptide as found in the context of the whole native molecule. For example, the peptides 
may be altered to have an N-terminal cysteine and a C-terminal hydrophobic amidated 
tail. Alternatively, the addition or substitution of a D-stereoisomer form of one or more 
of the amino acids may be performed to create a beneficial derivative, for example to 
enhance stability of the peptide. 

Alternatively, peptide mimotopes may be identified using antibodies which are capable 
themselves of binding to the polypeptides of the present invention using techniques such 
as phage display technology (EP 0 552 267 Bl). This technique, generates a large number 
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of peptide sequences which mimic the structure of the native peptides and are, therefore, 
capable of binding to anti-native peptide antibodies, but may not necessarily themselves 
share significant sequence homology to the native polypeptide. 

Vaccines 

Another aspect of the invention relates to a method for inducing an immunological 
response in an individual, particularly a mammal, preferably humans, which comprises 
inoculating the individual with BASB034 polynucleotide and/or polypeptide, or a 
fragment or variant thereof, adequate to produce antibody andV or T cell immune response 
to protect said individual from infection, particularly bacterial infection and most 
particularly Moraxella catarrhalis infection. Also provided are methods whereby such 
immunological response slows bacterial replication. Yet another aspect of the invention 
relates to a method of inducing immunological response in an individual which comprises 
delivering to such individual a nucleic acid vector, sequence or ribdzyme to direct 
expression of B ASB034 polynucleotide and/or polypeptide, or a fragment or a variant 
thereof, for expressing BASB034 polynucleotide and/or polypeptide, or a fragment or a 
variant thereof in vivo in order to induce an immunological response, such as, to produce 
antibody and/ or T cell immune response, including, for example, cytokine-producing T 
cells or cytotoxic T cells, to protect said individual, preferably a human, from disease, 
whether that disease is already established within the individual or not. One example of 
administering the gene is by accelerating it into the desired cells as a coating on particles 
or otherwise. Such nucleic acid vector may comprise DNA, RNA, a ribozyme, a 
modified nucleic acid, a DNA/RNA hybrid, a DNA-protein complex or an RNA-protein 
complex. 

A further aspect of the invention relates to an immunological composition that when 
introduced into an individual, preferably a human, capable of having induced within it an 
immunological response, induces an immunological response in such individual to a 
BASB034 polynucleotide and/or polypeptide encoded therefrom, wherein the 
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composition comprises a recombinant BASB034 polynucleotide and/or polypeptide 
encoded therefrom and/or comprises DNA and/or RNA which encodes and expresses an 
antigen of said BASB034 polynucleotide, polypeptide encoded therefrom, or other 
polypeptide of the invention. The immunological response may be used therapeutically 
or prophylactically and may take the form of antibody immunity and/or cellular 
immunity, such as cellular immunity arising from CTL or CD4+ T cells. 

A BASB034 polypeptide or a fragment thereof may be fused with co-protein or chemical 
moiety which may or may not by itself produce antibodies, but which is capable of 
stabilizing the first protein and producing a fused or modified protein which will have 
antigenic and/or immunogenic properties, and preferably protective properties. Thus 
fused recombinant protein, preferably further comprises an antigenic co-protein, such as 
lipoprotein D from Haemophilus influenzae, Glutathione-S- transferase (GST) or beta- 
galactosidase, or any other relatively large co-protein which solubilizes the protein and 
facilitates production and purification thereof. Moreover, the co-protein may act as an 
adjuvant in the sense of providing a generalized stimulation of the immune system of the 
organism receiving the protein. The co-protein may be attached to either the amino- or 
carboxy-terminus of the first protein. 

Provided by this invention are compositions, particularly vaccine compositions, and 
methods comprising the polypeptides and/or polynucleotides of the invention and 
immunostimulatory DNA sequences, such as those described in Sato, Y. et al. Science 
273: 352 (1996). 

Also, provided by this invention are methods using the described polynucleotide or 
particular fragments thereof, which have been shown to encode non-variable regions of 
bacterial cell surface proteins, in polynucleotide constructs used in such genetic 
immunization experiments in animal models of infection with Moraxelia caiarrhalis. 
Such experiments will be particularly useful for identifying protein epitopes able to 
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provoke a prophylactic or therapeutic immune response. It is believed that this approach 
will allow for the subsequent preparation of monoclonal antibodies of particular value, 
derived from the requisite organ of the animal successfully resisting or clearing infection, 
for the development of prophylactic agents or therapeutic treatments of bacterial 
infection, particularly Moraxella catarrhaiis infection, in mammals, particularly humans. 

The invention also includes a vaccine formulation which comprises an immunogenic 
recombinant polypeptide and/or polynucleotide of the invention together with a suitable 
carrier, such as a pharmaceutical ly acceptable carrier. Since the polypeptides and 
polynucleotides may be broken down in the stomach, each is preferably administered 
parenterally, including, for example, administration that is subcutaneous, intramuscular, 
intravenous, or intradermal. Formulations suitable for parenteral administration include 
aqueous and non-aqueous sterile injection solutions which may contain anti-oxidants, 
buffers, bacteriostatic compounds and solutes which render the formulation isotonic with 
the bodily fluid, preferably the blood, of the individual; and aqueous and non-aqueous 
sterile suspensions which may include suspending agents or thickening agents. The 
formulations may be presented in unit-dose or multi-dose containers, for example, sealed 
ampoules and vials and may be stored in a freeze-dried condition requiring only the 
addition of the sterile liquid carrier immediately prior to use. 

The vaccine formulation of the invention may also include adjuvant systems for 
enhancing the immunogenicity of the formulation. Preferably the adjuvant system 
raises preferentially a TH1 type of response. 

An immune response may be broadly distinguished into two extreme catagories. being a 
humoral or cell mediated immune responses (traditionally characterised by antibody and 
cellular effector mechanisms of protection respectively). These categories of response 
have been termed THl-type responses (cell-mediated response), and TH2-type immune 
responses (humoral response). 
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Extreme THl-type immune responses may be characterised by the generation of antigen 
specific, haplotype restricted cytotoxic T lymphocytes, and natural killer cell responses. 
In mice THl-type responses are often characterised by the generation of antibodies of 
the IgG2a subtype, whilst in the human these correspond to IgGl type antibodies. TH2- 
type immune responses are characterised by the generation of a broad range of 
immunoglobulin isotypes including in mice IgGl, IgA, and IgM. 

It can be considered that the driving force behind the development of these two types of 
immune responses are cytokines. High levels of THl-type cytokines tend to favour the 
induction of cell mediated immune responses to the given antigen, whilst high levels of 
TH2-type cytokines tend to favour the induction of humoral immune responses to the 
antigen. 

The distinction of TH1 and TH2-type immune responses is not absolute. In reality an 
individual will support an immune response which is described as being predominantly 
TH1 or predominantly TH2. However, it is often convenient to consider the families of 
cytokines in terms of that described in murine CD4 +ve T cell clones by Mosmann and 
Coffman (Mosmann, T.R. and Coffmaru R.L. (1989) THl andTH2 cells: different 
patterns of lymphokine secretion lead to different functional properties. Annual Review 
of Immunology, 7, pi 45-1 75). Traditionally, THl-type responses are associated with 
the production of the INF-y and IL-2 cytokines by T-lymphocytes. Other cytokines 
often directly associated with the induction of THl-type immune responses are not 
produced by T-cells, such as IL-12. In contrast, TH2- type responses are associated with 
the secretion of IL-4, IL-5, IL-6 and IL-13. 

It is known that certain vaccine adjuvants are particularly suited to the stimulation of 
either THl or TH2 - type cytokine responses. Traditionally the best indicators of the 
THl :TH2 balance of the immune response after a vaccination or infection includes 
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direct measurement of the production of TH1 or TH2 cytokines by T lymphocytes in 
vitro after restimuiation with antigen, and/or the measurement of the IgGl:IgG2a ratio 
of antigen specific antibody responses. 

Thus, a THl-type adjuvant is one which preferentially stimulates isolated T-cell 
populations to produce high levels of THl-type cytokines when re-stimulated with 
antigen in vitro, and promotes development of both CD8+ cytotoxic T lymphocytes and 
antigen specific immunoglobulin responses associated with THl-type isotype. 

Adjuvants which are capable of preferential stimulation of the TH1 cell response are 
described in International Patent Application No. WO 94/00153 and WO 95/17209. 

3 De-O-acylated monophosphoryl lipid A (3D-MPL) is one such adjuvant. This is 
known from GB 222021 1 (Ribi). Chemically it is a mixture of 3 De-O-acylated 
monophosphoryl lipid A with 4, 5 or 6 acylated chains and is manufactured by Ribi 
Immunochem, Montana. A preferred form of 3 De-O-acylated monophosphoryl lipid 
A is disclosed in European Patent 0 689 454 Bl (SmithKline Beecham Biologicals SA). 

Preferably, the particles of 3D-MPL are small enough to be sterile filtered through a 
0.22micron membrane (European Patent number 0 689 454). 
3D-MPL will be present in the range of 10|ig - I00|ig preferably 25-50jig per dose 
wherein the antigen will typically be present in a range 2-50|ig per dose. 

Another preferred adjuvant comprises QS21, an Hplc purified non-toxic fraction 
derived from the bark of Quillaja Saponaria Molina. Optionally this may be admixed 
with 3 De-O-acylated monophosphoryl lipid A (3D-MPL), optionally together with an 
carrier. 

The method of production of QS21 is disclosed in US patent No. 5,057,540. 
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Non-reactogenic adjuvant formulations containing QS2I have been described 
previously (WO 96/33739). Such formulations comprising QS21 and cholesterol have 
been shown to be successful TH1 stimulating adjuvants when formulated together with 
an antigen. 

Further adjuvants which are preferential stimulators of TH1 cell response include 
immunomodulatory oligonucleotides, for example unmethyiated CpG sequences as 
disclosed in WO 96/02555. 

Combinations of different TH1 stimulating adjuvants, such as those mentioned 
hereinabove, are also contemplated as providing an adjuvant which is a preferential 
stimulator of TH1 cell response. For example, QS21 can be formulated together with 
3D-MPL. The ratio of QS2 1 : 3D-MPL will typically be in the order of 1 : 1 0 to 10 : 1 ; 
preferably 1 :5 to 5 : 1 and often substantially 1:1. The preferred range for optimal 
synergy is 2.5 : 1 to 1 : 1 3D-MPL: QS21. 

Preferably a carrier is also present in the vaccine composition according to the 
invention. The carrier may be an oiLin water emulsion, or an aluminium salt, such as 
aluminium phosphate or aluminium hydroxide. 

A preferred oil-in-water emulsion comprises a metabolisible oil, such as squalene, alpha 
tocopherol and Tween 80. In a particularly preferred aspect the antigens in the vaccine 
composition according to the invention are combined with QS21 and 3D-MPL in such 
an emulsion. Additionally the oil in water emulsion may contain span 85 and/or 
lecithin and/or tricapryltn. 

Typically for human administration QS21 and 3D-MPL will be present in a vaccine in 
the range of I jag - 200|ig, such as 10-I00ng, preferably 10u.g - 50|ig per dose. 
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Typically the oil in water will comprise from 2 to 10% squalene, from 2 to 10% alpha 
tocopherol and from 0.3 to 3% tween 80. Preferably the ratio of squalene: alpha 
tocopherol is equal to or less than 1 as this provides a more stable emulsion. Span 85 
may also be present at a level of 1%. In some cases it may be advantageous that the 
vaccines of the present invention will further contain a stabiliser. 

Non-toxic oil in water emulsions preferably contain a non-toxic oil, e.g. squalane or 
squalene, an emulsifier, e.g. Tween 80, in an aqueous carrier. The aqueous carrier may 
be, for example, phosphate buffered saline. 

A particularly potent adjuvant formulation involving QS21, 3D-MPL and tocopherol 
in an oil in water emulsion is described in WO 95/17210. 

The present invention also provides a polyvalent vaccine composition comprising a 
vaccine formulation of the invention in combination with other antigens, in particular 
antigens useful for treating cancers, autoimmune diseases and related conditions. Such a 
polyvalent vaccine composition may include a TH-1 inducing adjuvant as hereinbefore 
described. 

While the invention has been described with reference to certain BASB034 polypeptides 
and polynucleotides, it is to be understood that this covers fragments of the naturally 
occurring polypeptides and polynucleotides, and similar polypeptides and polynucleotides 
with additions, deletions or substitutions which do not substantially affect the 
immunogenic properties of the recombinant polypeptides or polynucleotides. 

Compositions, kits and administration 

In a further aspect of the invention there are provided compositions comprising a BASB034 
polynucleotide and/or a BASB034 polypeptide for administration to a cell or to a 
multicellular organism. 
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The invention also relates to compositions comprising a polynucleotide and/or a 
polypeptides discussed herein or their agonists or antagonists. The polypeptides and 
polynucleotides of the invention may be employed in combination with a non-sterile or 
sterile carrier or carriers for use with cells, tissues or organisms, such as a pharmaceutical 
carrier suitable for administration to an individual. Such compositions comprise, for 
instance, a media additive or a therapeutically effective amount of a polypeptide and/or 
polynucleotide of the invention and a pharmaceutically acceptable carrier or excipient. Such 
carriers may include, but are not limited to, saline, buffered saline, dextrose, water, glycerol, 
ethanol and combinations thereof. The formulation should suit the mode of administration. 
The invention further relates to diagnostic and pharmaceutical packs and kits comprising 
one or more containers filled with one or more of the ingredients of the aforementioned 
compositions of the invention. 

Polypeptides, polynucleotides and other compounds of the invention may be employed 
alone or in conjunction with other compounds, such as therapeutic compounds. 

The pharmaceutical compositions may be administered in any effective, convenient manner 
including, for instance, administration by topical, oral, anal, vaginal, intravenous, 
intraperitoneal, intramuscular, subcutaneous, intranasal or intradermal routes among others. 

In therapy or as a prophylactic, the active agent may be administered to an individual as 
an injectable composition, for example as a sterile aqueous dispersion, preferably 
isotonic. 

In a further aspect, the present invention provides for pharmaceutical compositions 
comprising a therapeutically effective amount of a polypeptide and/or polynucleotide, such 
as the soluble form of a polypeptide and/or polynucleotide of the present invention, agonist 
or antagonist peptide or small molecule compound, in combination with a pharmaceutically 
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acceptable carrier or excipient. Such carriers include, but are not limited to, saline, buffered 
saline, dextrose, water, glycerol, ethanol, and combinations thereof. The invention further 
relates to pharmaceutical packs and kits comprising one or more containers filled with one 
or more of the ingredients of the aforementioned compositions of the invention. 
Polypeptides, polynucleotides and other compounds of the present invention may be 
employed alone or in conjunction with other compounds, such as therapeutic compounds. 

The composition will be adapted to the route of administration, for instance by a systemic or 
an oral route. Preferred forms of systemic administration include injection, typically by 
intravenous injection. Other injection routes, such as subcutaneous, intramuscular, or 
intraperitoneal, can be used. Alternative means for systemic administration include 
transmucosal and transdermal administration using penetrants such as bile salts or fusidic 
acids or other detergents. In addition, if a polypeptide or other compounds of the present 
invention can be formulated in an enteric or an encapsulated formulation, oral 
administration may also be possible. Administration of these compounds may also be 
topical and/or localized, in the form of salves, pastes, gels, solutions, powders and the like. 

For administration to mammals, and particularly humans, it is expected that the daily 
dosage level of the active agent will be from 0.01 mg/kg to 10 mg/kg, typically around 1 
mg/kg. The physician in any event will determine the actual dosage which will be most 
suitable for an individual and will vary with the age, weight and response of the particular 
individual. The above dosages are exemplary of the average case. There can, of course, 
be individual instances where higher or lower dosage ranges are merited, and such are 
within the scope of this invention. 

The dosage range required depends on the choice of peptide, the route of administration, the 
nature of the formulation, the nature of die subject's condition, and the judgment of the 
attending practitioner. Suitable dosages, however, are in the range of 0.1-100 ug/kg of 
subject. 
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A vaccine composition is conveniently in injectable form. Conventional adjuvants may be 
employed to enhance the immune response. A suitable unit dose for vaccination is 0.5-5 
microgram/kg of antigen, and such dose is preferably administered 1-3 times and with an 
interval of 1-3 weeks. With the indicated dose range, no adverse toxicological effects will 
be observed with the compounds of the invention which would preclude their 
administration to suitable individuals. 

Wide variations in the needed dosage, however, are to be expected in view of the variety of 
compounds available and the differing efficiencies of various routes of administration. For 
example, oral administration would be expected to require higher dosages than 
administration by intravenous injection. Variations in these dosage levels can be adjusted 
using standard empirical routines for optimization, as is well understood in the art. 

Sequence Databases, Sequences in a Tangible Medium, and Algorithms 

Polynucleotide and polypeptide sequences form a valuable information resource with which 
to determine their 2- and 3-dimensional structures as well as to identify further sequences of 
similar homology. These approaches are most easily facilitated by storing the sequence in a 
computer readable medium and then using the stored data in a known macromolecular 
structure program or to search a sequence database using well known searching tools, such 
as the GCG program package. 

Also provided by the invention are methods for the analysis of character sequences or 
strings, particularly genetic sequences or encoded protein sequences. Preferred methods 
of sequence analysis include, for example, methods of sequence homology analysis, such 
as identity and similarity analysis, DNA. RNA and protein structure analysis, sequence 
assembly, cladistic analysis, sequence motif analysis, open reading frame determination. 
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nucleic acid base calling, codon usage analysis, nucleic acid base trimming, and 
sequencing chromatogram peak analysis. 

A computer based method is provided for performing homology identification. This 
method comprises the steps of: providing a first polynucleotide sequence comprising the 
sequence of a polynucleotide of the invention in a computer readable medium; and 
comparing said first polynucleotide sequence to at least one second polynucleotide or 
polypeptide sequence to identify homology. 

A computer based method is also provided for performing homology identification, said 
method comprising the steps of: providing a first polypeptide sequence comprising the 
sequence of a polypeptide of the invention in a computer readable medium; and 
comparing said first polypeptide sequence to at least one second polynucleotide or 
polypeptide sequence to identify homology. 

All publications and references, including but not limited to patents and patent 
applications, cited in this specification are herein incorporated by reference in their 
entirety as if each individual publication or reference were specifically and individually 
indicated to be incorporated by reference herein as being fully set forth. Any patent 
application to which this application claims priority is also incorporated by reference 
herein in its entirety in the manner described above for publications and references. 

DEFINITIONS 

"Identity," as known in the art, is a relationship between two or more polypeptide sequences 
or two or more polynucleotide sequences, as the case may be, as determined by comparing 
the sequences. In the art. "identity" also means the degree of sequence relatedness between 
polypeptide or polynucleotide sequences, as the case may be, as determined by the match 
between strings of such sequences. "Identity" can be readily calculated by known 
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methods, including but not limited to those described in {Computational Molecular 
Biology, Lesk, A.M., ed., Oxford University Press, New York, 1988; Biocomputing: 
Informatics and Genome Projects, Smith, D.W., ed., Academic Press, New York, 1993; 
Computer Analysis of Sequence Data, Part I, Griffin, A.M., and Griffin, H.G., eds., 
Humana Press, New Jersey, 1994; Sequence Analysis in Molecular Biology, von Heine, 
G., Academic Press, 1987; and Sequence Analysis Primer, Gribskov, M. and Devereux, J., 
eds., M Stockton Press, New York, 199 1 ; and Carillo, H., and Lipman, D., SIAM J. 
Applied Math., 48: 1073 (1988). Methods to determine identity are designed to give the 
largest match between the sequences tested. Moreover, methods to determine identity are 
codified in publicly available computer programs. Computer program methods to 
determine identity between two sequences include, but are not limited to, the GAP 
program in the GCG program package (Devereux, J., et al., Nucleic Acids Research 12(1): 
387 (1984)), BLASTP, BLASTN (Altschul, S.F. et al., J. Molec. Biol. 215: 403-410 
(1990), and FASTA( Pearson and Lipman Proc. Natl. Acad. Sci. USA 85; 2444-2448 
(1988). The BLAST family of programs is publicly available from NCBI and other 
sources {BLAST Manual, Altschul, S., et al, NCBI NLM NIH Bethesda, MD 20894; 
Altschul, S., et al,J. Mol Biol 215: 403-410 (1990). The well known Smith Waterman 
algorithm may also be used to determine identity. 

Parameters for polypeptide sequence comparison include the following: 

Algorithm: Needleman and Wunsch, J. Mol Biol. 48: 443-453 (1970) 

Comparison matrix: BLOSSUM62 from Henikoff and HenikofF, 

Proc. Natl. Acad. Sci. USA. 89:10915-10919 (1992) 

Gap Penalty: 8 

Gap Length Penalty: 2 

A program useful with these parameters is publicly available as the "gap" program from 
Genetics Computer Group, Madison Wl. The aforementioned parameters are the default 
parameters for peptide comparisons (along with no penalty for end gaps). 
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Parameters for polynucleotide comparison include the following: 
Algorithm: Needleman and Wunsch, J. Mol Biol. 48: 443-453 (1970) 
Comparison matrix: matches = +10, mismatch = 0 
Gap Penalty: 50 
Gap Length Penalty: 3 

Available as: The "gap" program from Genetics Computer Group, Madison WI. These 
are the default parameters for nucleic acid comparisons. 

A preferred meaning for "identity" for polynucleotides and polypeptides, as the case may 
be, are provided in (1) and (2) below. 

(1 ) Polynucleotide embodiments further include an isolated polynucleotide 
comprising a polynucleotide sequence having at least a 50, 60, 70, 80, 85, 90, 95, 97 or 
100% identity to the reference sequence of SEQ ID NO:l, wherein said polynucleotide 
sequence may be identical to the reference sequence of SEQ ID NO:l or may include up 
to a certain integer number of nucleotide alterations as compared to the reference 
sequence, wherein said alterations are selected from the group consisting of at least one 
nucleotide deletion, substitution, including transition and transversion, or insertion, and 
wherein said alterations may occur at the 5* or 3' terminal positions of the reference 
nucleotide sequence or anywhere between those terminal positions, interspersed either 
individually among the nucleotides in the reference sequence or in one or more 
contiguous groups within the reference sequence, and wherein said number of nucleotide 
alterations is determined by multiplying the total number of nucleotides in SEQ ID NO:l 
by the integer defining the percent identity divided by 100 and then subtracting that 
product from said total number of nucleotides in SEQ ID NO: I, or: 



n n ^ x n ' (*n • y)* 
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wherein n n is the number of nucleotide alterations, x n is the total number of nucleotides 
in SEQ ID NO:l, y is 0.50 for 50%, 0.60 for 60%, 0.70 for 70%, 0.80 for 80%, 0.85 for 
85%, 0.90 for 90%, 0.95 for 95%, 0.97 for 97% or 1 .00 for 100%, and • is the symbol for 
the multiplication operator, and wherein any non-integer product of x n and y is rounded 
down to the nearest integer prior to subtracting it from x n . Alterations of a polynucleotide 
sequence encoding the polypeptide of SEQ ID NO:2 may create nonsense, missense or 
frameshift mutations in this coding sequence and thereby alter the polypeptide encoded by 
the polynucleotide following such alterations. 

By way of example, a polynucleotide sequence of the present invention may be identical 
to the reference sequence of SEQ ID NO: 1, that is it may be 100% identical, or it may 
include up to a certain integer number of nucleic acid alterations as compared to the 
reference sequence such that the percent identity is less than 100% identity. Such 
alterations are selected from the group consisting of at least one nucleic acid deletion, 
substitution, including transition and transversion, or insertion, and wherein said 
alterations may occur at the 5* or 3' terminal positions of the reference polynucleotide 
sequence or anywhere between those terminal positions, interspersed either individually 
among the nucleic acids in the reference sequence or in one or more contiguous groups 
within the reference sequence. The number of nucleic acid alterations for a given percent 
identity is determined by multiplying the total number of nucleic acids in SEQ ID NO:l 
by the integer defining the percent identity divided by 1 00 and then subtracting that 
product from said total number of nucleic acids in SEQ ID NO:l, or: 

n n <£ x n - (x n • y), 

wherein n n is the number of nucleic acid alterations, x n is the total number of nucleic 
acids in SEQ ID NO: I. y is, for instance 0.70 for 70%, 0.80 for 80%, 0.85 for 85% etc., • 
is the symbol for the multiplication operator, and wherein any non-integer product of x n 
and y is rounded down to the nearest integer prior to subtracting it from x n . 
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(2) Polypeptide embodiments further include an isolated polypeptide comprising a 
polypeptide having at least a 50,60, 70, 80, 85, 90, 95, 97 or 100% identity to a 
polypeptide reference sequence of SEQ ID NO:2, wherein said polypeptide sequence may 
be identical to the reference sequence of SEQ ID NO:2 or may include up to a certain 
integer number of amino acid alterations as compared to the reference sequence, wherein 
said alterations are selected from the group consisting of at least one amino acid deletion, 
substitution, including conservative and non-conservative substitution, or insertion, and 
wherein said alterations may occur at the amino- or carboxy-terminal positions of the 
reference polypeptide sequence or anywhere between those terminal positions, 
interspersed either individually among the amino acids in the reference sequence or in one 
or more contiguous groups within the reference sequence, and wherein said number of 
amino acid alterations is determined by multiplying the total number of amino acids in 
SEQ ID NO:2 by the integer defining the percent identity divided by 100 and then 
subtracting that product from said total number of amino acids in SEQ ID NO:2, or: 

n a Z x a - (x a • y), 

wherein n a is the number of amino acid alterations, x a is the total number of amino acids 
in SEQ ID NO:2, y is 0.50 for 50%, 0.60 for 60%, 0.70 for 70%, 0.80 for 80%, 0.85 for 
85%, 0.90 for 90%, 0.95 for 95%, 0.97 for 97% or 1.00 for 100%, and • is the symbol for 
the multiplication operator, and wherein any non-integer product of x a and y is rounded 
down to the nearest integer prior to subtracting it from x a . 

By way of example, a polypeptide sequence of the present invention may be identical to 
the reference sequence of SEQ ID NO:2, that is it may be 100% identical, or it may 
include up to a certain integer number of amino acid alterations as compared to the 
reference sequence such that the percent identity is less than 100% identity. Such 
alterations are selected from the group consisting of at least one amino acid deletion. 
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substitution, including conservative and non-conservative substitution, or insertion, and 
wherein said alterations may occur at the amino- or carboxy-terminal positions of the 
reference polypeptide sequence or anywhere between those terminal positions, 
interspersed either individually among the amino acids in the reference sequence or in one 
or more contiguous groups within the reference sequence. The number of amino acid 
alterations for a given % identity is determined by multiplying the total number of amino 
acids in SEQ ID NO:2 by the integer defining the percent identity divided by 100 and 
then subtracting that product from said total number of amino acids in SEQ ID NOc2, or: 

n a £ x a - (x a • y), 

wherein n a is the number of amino acid alterations, x a is the total number of amino acids 
in SEQ ID NO:2, y is, for instance 0.70 for 70%, 0.80 for 80%, 0.85 for 85% etc., and • is 
the symbol for the multiplication operator, and wherein any non-integer product of x a and 
y is rounded down to the nearest integer prior to subtracting it from x a . 

"Individuals),' 1 when used herein with reference to an organism, means a multicellular 
eukaryote, including, but not limited to a metazoan, a mammal, an ovid, a bovid, a 
simian, a primate, and a human. 

"Isolated" means altered "by the hand of man" from its natural state, Le., if it occurs in 
nature, it has been changed or removed from its original environment, or both* For example, 
a polynucleotide or a polypeptide naturally present in a living organism is not "isolated," but 
the same polynucleotide or polypeptide separated from the coexisting materials of its natural 
state is "isolated", as the term is employed herein. Moreover, a polynucleotide or 
polypeptide that is introduced into an organism by transformation, genetic manipulation or 
by any other recombinant method is "isolated" even if it is still present in said organism, 
which organism may be living or non-living. 
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"Polynucleotide(s) M generally refers to any polyribonucleotide or polydeoxyribonucleotide, 
which may be unmodified RNA or DNA or modified RNA or DNA including single and 
double-stranded regions. 

"Variant" refers to a polynucleotide or polypeptide that differs from a reference 
polynucleotide or polypeptide, but retains essential properties. A typical variant of a 
polynucleotide differs in nucleotide sequence from another, reference polynucleotide. 
Changes in the nucleotide sequence of the variant may or may not alter the amino acid 
sequence of a polypeptide encoded by the reference polynucleotide. Nucleotide 
changes may result in amino acid substitutions, additions, deletions, fusions and 
truncations in the polypeptide encoded by the reference sequence, as discussed below. 
A typical variant of a polypeptide differs in amino acid sequence from another, 
reference polypeptide. Generally, differences are limited so that the sequences of the 
reference polypeptide and the variant are closely similar overall and, in many regions, 
identical. A variant and reference polypeptide may differ in amino acid sequence by 
one or more substitutions, additions, deletions in any combination. A substituted or 
inserted amino acid residue may or may not be one encoded by the genetic code. A 
variant of a polynucleotide or polypeptide may be a naturally occurring such as an 
allelic variant, or it may be a variant that is not known to occur naturally. Non-naturally 
occurring variants of polynucleotides and polypeptides may be made by mutagenesis 
techniques or by direct synthesis. 

"Disease(s)" means any disease caused by or related to infection by a bacteria, including, 
for example, otitis media in infants and children, pneumonia in elderiies, sinusitis, 
nosocomial infections and invasive diseases, chronic otitis media with hearing loss, fluid 
accumulation in the middle ear, auditive nerve damage, delayed speech learning, infection 
of the upper respiratory tract and inflammation of the middle ear. 
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EXAMPLES: 

The examples below are carried out using standard techniques, which are well known and 
routine to those of skill in the art, except where otherwise described in detail.The examples 
are illustrative, but do not limit the invention. 

Example 1: 

Discovery and confirmatory DNA sequencing of the BASB034 gene from 
Moraxella catarrhalis strain ATCC 43617. 

The BASB034 gene was first discovered in the Incyte PathoSeq data base containing 
unfinished genomic DNA sequences of the Moraxella catarrhalis strain ATCC 43617 
(also referred to as strain MC293 1). The translation of the BASB034 polynucleotide 
sequence showed significant similarity (42 % identity in a 214 amino acids overlap) to a 
Klebsiella pneumoniae outer membrane phospholipase A protein. 

The sequence of the BASB034 gene was further confirmed experimentally. For this 
purpose, genomic DNA was extracted from 10 10 cells of the Af. catarrhalis cells (strain 
ATCC 43617) using the QIAGEN genomic DNA extraction kit (Qiagen Gmbh), and 
lug of this material was submitted to Polymerase Chain Reaction DNA amplification 
using primers E481 124 (5 1 - GAT TTA AGA GTA TGT TAT GAT G -3') 
[SEQIDNO:9] and E481 125 (S'-GTA TGG GTT GAT CAA ATA CAG-3*) 
[SEQIDNO:10]. This PCR product was purified on a Biorobot 9600 (Qiagen Gmbh) 
apparatus and subjected to DNA sequencing using the Big Dye Cycle Sequencing kit 
(Perkin-Elmer) and an ABI 377/PRISM DNA sequencer. DNA sequencing was 



performed on both strands with a redundancy of 2 and the full-length sequence was 
assembled using the Sequencher™ software (Applied Biosytems). The resulting DNA 
sequence turned out to be 100% identical to SEQ ID NO: 1 . 
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Example 2: 

Variability analysis of the B ASB034 gene among several Moraxella catarrhatis 
strains. 

5 

2A; Restriction Fragment Length Analysis (RFLP). 

Genomic DNA was extracted from 16 M. catarrhalis strains (presented in Table I) as 
described below. M catarrhalis was streaked for single colonies on BHI agar plates and 
grown overnight at 37 °C. Three or four single colonies were picked and used to 

1 0 inoculate a -1.5 ml BHI (Brain-heart infusion) broth seed culture which was grown 
overnight in a shaking incubator, -300 rpm, at 37 °C. A 500ml erlenmeyer flask 
containing -150 ml of BHI broth was inoculated with the seed culture and grown for 
-12-16 hours at 37 °C in a shaking incubator, -175 rpm, to generate cell mass for DNA 
isolation. Cells were collected by centrifugation in a Sorvall GSA rotor at -2000 X g 

15 for 15 minutes at room temperature. The supernatant was removed and the cell pellet 
suspended in -5.0 ml of sterile water. An equal volume of lysis buffer (200 mM NaCl, 
20 mM EDTA, 40 mM Tris-Hcl, pH 8.0, 0.5% (w/v) SDS, 0.5% (v/v) 2- 
mercaptoethanol, and 250 fig/mi of proteinase K) was added and the cells suspended by 
gentle agitation and trituration. The cell suspension was then incubated -12 hours at 

20 50°C to lyse the bacteria and liberate chromosomal DNA. Proteinaceous material was 
precipitated by the addition of 5.0 ml of saturated NaCl (-6.0 M, in sterile water) and 
centrifugation at -5,500xg in a Sorvall SS34 rotor at room temperature. Chromosomal 
DNA was precipitated from the cleared supernatant by the addition of two volumes of 
100 % ethanol. Aggregated DNA was collected and washed using gentle agitation in a 

25 small volume of a 70 % ethanol solution. Purified chromosomal DNA was suspended 
in sterile water and allowed to dissolve/ disburse overnight at 4 °C by gentle rocking. 
The concentration of dissolved DNA was determined spectrophotometricaliy at 260 nm 
using an extinction coefficient of 1 .0 O.D. unit -50 jig/ml. 
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This material was next submitted to PCR amplification using the MC-Pla-BamF (5'- 
AAG GGC CCA ATT ACG CAG AGG GGA TCC CAA GCT GTA CCA AAT CCT 
GTG GCA TTT GTT G -3') [SEQ ID NO:l i] and MC-Pla-SalRC (S'-AAG GGC CCA 
ATT ACG CAG AGG GTC GAC TTA TTA TAG ACC CAT CCA GTC GTT AAG 
5 CAT AAG -3') [SEQ ID NO: 12] oligonucleotides. The corresponding BASB034 gene 
amplicons were then subjected independently to hydrolysis using restriction enzymes 
(HphI, Alul, Rsa I, EcoRV, Sau3AI) and restriction products were separated by agarose 
or poly aery lam ide gel electrophoresis using standard molecular biology procedures as 
described in "Molecular Cloning, a Laboratory Manual, Second Edition, Eds: 
10 Sambrook, Fritsch & Maniatis, Cold Spring Harbor press 1989" . The photographs of 
^ the resulting electrophoresis gels are displayed in Figure 1 . For each strain. RFLP 

L J3 patterns corresponding to the 6 restriction enzymes were scored and combined. Groups 

of strains sharing identical combination of RFLP patterns were then defined. Using this 
methodology, the strains tested in this study fell into 3 genomic groups (Group 1 : 
m 15 Mc2931, Mc 2904, Mc 2905, Mc2907, Mc2909, Mc2910, Mc29ll, Mc2912, Mc2926, 

W Mc293 1 , Mc2956, Mc2969, Mc2975 ; Group 2 : Mc2906, Mc29 1 3 ; Group 3 : Mc2908. 

3 

p (Mc2960 failed to be amplified and consequently was not classified). These data 

M ; support that the Moraxella catarrhalis population used in this study displays limited 

4g nucleotide sequence diversity for the BASB034 gene. 



W 20 



2B: DNA sequencing in other strains. 

Using the experimental procedure described in Example 1, the sequence of the 
BASB034 gene was also determined for three additional Moraxella catarrhalis strains. 
The nucleotide sequences of the BASB034 gene of the strains Mc2908, Mc2913 and 
25 Mc2969. representative of the three genomic groups identified previously, are shown in 
SEQ ID NO:3. 5 and 7. respectively. These nucleotide sequences were translated into 
amino acid sequences, which are shown in SEQ ID NO:4, 6 and 8. respectively. Using 
the MegAlign program from the DNASTAR Lasergene package, a multiple alignment 
of the nucleotide sequences of SEQ ID NO:l, 3, 5 and 7 was performed, and is 
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displayed in Figure 2. A painvise comparison of identities is summarized in Table 2, 
showing that the four BASB034 nucleotide gene sequences are all similar at an identity 
level greater than 98 % . Using the same program, a multiple alignment of the protein 
sequences of SEQ ID NO:2, 4, 6 and 8 was performed, and is displayed in Figure 3. A 
pairwise comparison of identities is summarized in Table 3, showing that the four 
BASB034 protein sequences are all similar at an identity level greater than 98 %. Taken 
together, these data indicate very strong sequence conservation of the BASB034 gene 
among Moraxella catarrhalis strains. 

Table 1: Features of the Moraxella catarrhalis strains used in this study 



Strain 


Isolated in: 


from: 


Mc2904 


USA 


Tympanocentesis 


Mc2905 


USA 


Tympanocentesis 


Mc2906 


USA 


Tympanocentesis 


Mc2907 


USA 


Tympanocentesis 


Mc2908 


USA 


Acute otitis Tympanocentesis 


Mc2909 


USA 


Tympanocentesis 


Mc2910 


USA 


Tympanocentesis 


Mc2911 


USA 


Acute otitis Tympanocentesis 


Mc2912 


USA 


Acute otitis Tympanocentesis 


Mc2913 


USA 


Acute otitis Tympanocentesis 


Mc2926 


USA 


Tympanocentesis 


Mc2931 
/ATCC 
43617 


USA 


Transtracheal aspirate 


Mc2956 


Finland 


Middle ear fluid 


Mc2960 


Finland 


Middle ear fluid 


Mc2969 


Norway 


Nasopharynx (Pharyngitis-Rhinitis) 


Mc2975 


Norway 


Nasopharynx (Rhinitis) 
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Table 2: Pairwise identities of the BASB034 polynucleotide sequences ( in % ) 





SeqID No:3 


SeqID No:5 


SeqID No:7 


SeqID No:l 


98.7 


99.2 


99.7 


SeqID No:3 




99.3 


98.7 


SeqID No:5 






99.1 



Table 3. Pairwise identities of the BASB034 polypeptide sequences ( in % ) 





SeqID No:4 


SeqID No:6 


SeqID No:8 


SeqID No:2 


98.6 


99.3 


99.5 


SeqID No: 4 




98.9 


99.1 


SeqID No:6 






99.3 
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Example 3: Construction of Plasmid to Express Recombinant BASB034 

A: Cloning of BASB034 . 

The BamHl and Sail restriction sites engineered into the forward ([SEQ ID NO:l 1]) and 
reverse ([SEQ ID NO: 12]) amplification primers, respectively, permitted directional 
cloning of a BASB034 PCR product into the commercially available E.coli expression 

) plasmid pQE30 (QiaGen, ampiciilin resistant) such that a mature BASB034 protein 
could be expressed as a fusion protein containing a (His)6 affinity chromatography tag 
at the N-terminus. The BASB034 PCR product was purified from the amplification 
reaction using silica gel-based spin columns (QiaGen) according to the manufacturers 
instructions. To produce the required BamHl and Sail termini necessary for cloning, 

5 purified PCR product was sequentially digested to completion with BamHl and Sail 
restriction enzymes as recommended by the manufacturer (Life Technologies). 
Following the first restriction digestion, the PCR product was purified via spin column 
as above to remove salts and eluted in sterile water prior to the second enzyme 
digestion. The digested DNA fragment was again purified using silica gel-based spin 

3 columns prior to ligation with the pQE30 plasmid. 

B: Production of Expression Vector. 

To prepare the expression plasmid pQE30 for ligation, it was similarly digested to 
completion with both BamHl and Sail and then treated with calf intestinal phosphatase 

5 (CIP, -0.02 units / pmole of 5 1 end, Life Technologies) as directed by the manufacturer 
to prevent self ligation. An approximately 5-fold molar excess of the digested fragment 
to the prepared vector was used to program the ligation reaction. A standard -20 ul 
ligation reaction (~16°C, -16 hours), using methods well known in the art. was 
performed using T4 DNA ligase (-2.0 units / reaction. Life Technologies). An aliquot 

0 of the ligation (-5 u.1) was used to transform electro-competent M15(pREP4) cells 
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according to methods well known in the art. Following a -2-3 hour outgrowth period at 
37°C in -1 .0 ml of LB broth, transformed cells were plated on LB agar plates 
containing kanamycin (50 jig/ml) and ampicillin (100 ng/ml). Both antibiotics were 
included in the selection media to ensure that all transformed cells carried both the 



expression for IPTG-inducible expression of proteins on pQE30, and the pQE30- 
BASB034 plasmid (ApR). Plates were incubated overnight at 37°C for -16 hours. 
Individual KnR / ApR colonies were picked with sterile toothpicks and used to "patch" 
inoculate fresh LB KnR / ApR plates as well as a -L0 ml LB KjiR / ApR broth culture. 
Both the patch plates and the broth culture were incubated overnight at 37°C in either a 
standard incubator (plates) or a shaking water bath. 

A whole cell-based PCR analysis was employed to verify that transformants contained 
the BASB034 DNA insert. Here, the -1.0 ml overnight LB Kji / Ap broth culture was 
transferred to a 1.5 ml polypropylene tube and the cells collected by centrifugation in a 
Beckman microcentrifuge (-3 min., room temperature, -12,000 X g). The cell pellet 
was suspended in -200^1 of sterile water and a -10|il aliquot used to program a -50|il 
final volume PCR reaction containing both BASB034 forward and reverse amplification 
primers. Final concentrations of the PCR reaction components were essentially the 
same as those specified in example 2 except -5.0 units of Taq polymerase was used. 
The initial 95°C denaturation step was increased to 3 minutes to ensure thermal 
disruption of the bacterial cells and liberation of plasmid DNA. An ABI Model 9700 
thermal cycler and a 32 cycle, three-step thermal amplification profile, i.e. 95°C, 45sec; 
55-58°C, 45sec, 72°C, Imin., were used to amplify the BASB034 PCR fragment from 
the lysed transformant samples. Following thermal amplification, a -20jil aliquot of the 
reaction was analyzed by agarose gel electrophoresis (0.8 % agarose in a Tris-acetate- 
EDTA (TAE) buffer). DNA fragments were visualized by UV illumination after gel 
electrophoresis and ethidium bromide staining. A DNA molecular size standard (1 Kb 
ladder. Life Technologies) was electrophoresed in parallel with the test samples and was 



pR£P4 plasmid (KnR), which carries the laclq gene necessary for the repression of 
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used to estimate the size of the PCR products. Transformants that produced the 
expected PCR product were identified as strains containing a BASB034 expression 
construct. Expression plasmid containing strains were then analyzed for the inducible 
expression of recombinant BASB034. 

C: Expression Analysis of PCR-Positive Transformants. 

For each PCR-positive transformant identified above, -5.0 ml of LB broth containing 
kanamycin (50 |ig/ml) and ampicillin (100 ng/ml) was inoculated with cells from the 
patch plate and grown overnight at 37 °C with shaking (-250 rpm). An aliquot of the 
overnight seed culture (-1.0 ml) was inoculated into a 125 ml erlenmeyer flask 
containing -25 ml of LB Kn / Ap broth and grown at 37 °C with shaking (-250 rpm) 
until the culture turbidity reached O.D.600 of -0.5, i.e. mid-log phase (usually about 1.5 
- 2.0 hours). At this time approximately half of the culture (-12.5 ml) was transferred 
to a second 125 ml flask and expression of recombinant BASB034 protein induced by 
the addition of IPTG (1.0 M stock prepared in sterile water, Sigma) to a final 
concentration of 1.0 mM. Incubation of both the IPTG-induced and non-induced 
cultures continued for an additional -4 hours at 37 °C with shaking. Samples (-1 .0 ml) 
of both induced and non-induced cultures were removed after the induction period and 
the cells collected by centrifugation in a microcentrifuge at room temperature for -3 
minutes. Individual cell pellets were suspended in -50|xl of sterile water, then mixed 
with an equal volume of 2X Laemelli SDS-PAGE sample buffer containing 2- 
mercaptoethanol, and placed in boiling water bath for -3 min to denature protein. Equal 
volumes (-15^1) of both the crude IPTG-induced and the non-induced cell lysates were 
loaded onto duplicate 12% Tris/glycine polyacrylamide gel (I mm thick Mini-gels, 
Novex). The induced and non-induced lysate samples were electrophoresed together 
with prestained molecular weight markers (SeeBlue, Novex) under conventional 
conditions using a standard SDS/Tris/glycine running buffer (BioRad). Following 
electrophoresis, one gel was stained with commassie brilliant blue R250 (BioRad) and 
then destained to visualize novel BASB034 IPTG-inducible protein(s). The second gel 
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was electroblotted onto a PVDF membrane (0.45 micron pore size, Novex) for -2 hrs at 
4 °C using a BioRad Mini-Protean II blotting apparatus and Towbin's methanol (20 %) 
transfer buffer. Blocking of the membrane and antibody incubations were performed 
according to methods well known in the art. A monoclonal anti-RGS (His)3 antibody, 
followed by a second rabbit anti-mouse antibody conjugated to HRP (QiaGen), was 
used to confirm the expression and identity of the BASB034 recombinant protein. 
Visualization of the anti-His antibody reactive pattern was achieved using either an 
ABT insoluble substrate or using Hyperfilm with the Amersham ECL 
chemiluminescence system. 

D: Sequence Confirmation . 

To further verify that the IPTG-inducible recombinant BASB034 protein being 
expressed is in the correct open reading frame and not a spurious molecule arising from 
a cloning artifact (i.e. a frame-shift), the DNA sequence of the cloned insert was 
determined. The DNA sequence for the M.catarrhalis BASB034 gene was obtained 
from one strand using conventional asymmetric PCR cycle sequencing methodologies 
(ABI Prism Dye-Terminator Cycle Sequencing, Perkin-Elmer). Sequencing reactions 
were programmed with undigested expression plasmid DNA (-0.5|ig/rxn) as a template 
and appropriate pQE30 vector-specific and ORF-specific sequencing primers (-3.5 
pmol/rxn). In addition to the template and sequencing primer, each sequencing reaction 
(-20^1) contained the four different dNTPs (i.e. A,G,C, and T) and the four 
corresponding ddNTPs (i.e. ddA, ddG ? ddC, and ddT) terminator nucleotides; with each 
terminator being conjugated to one of the four fluorescent dyes, Joe, Tarn, Rox, or Fam. 
Single strand sequencing elongation products were terminated at random positions 
along the template by the incorporation of the dye-labelled ddNTP terminators. 
Fluorescent dye-labelled termination products were purified using microcentrifuge size- 
exclusion chromatography columns (Princeton Genetics), dried under vacuum, 
suspended in a Template Resuspension Buffer (Perkin-Elmer) for capillary 
electrophoresis or deionized formamide for PAGE, denatured at 95°C for -5 min. and 
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analyzed by high resolution capillary electrophoresis (ABI 310 Automated DNA 
Sequenator, Perkin-Elmer) or high resolution PAGE (ABI 377 Automated DNA 
Sequenator) as recommended by the manufacturer. DNA sequence data produced from 
individual reactions were collected and the relative fluorescent peak intensities analyzed 
5 automatically on a PowerMAC computer using ABI Sequence Analysis Software 
(Perkin-Elmer). Individually autoanalyzed DNA sequences were edited manually for 
accuracy before being merged into a consensus single strand sequence "string" using 
AutoAssembler software (Perkin-Elmer). Sequencing determined that the expression 
plasmid contained the correct sequence in the correct open reading frame. 

10 

Example 4; Production of Recombinant BASB034 

Bacterial strain 

15 A recombinant expression strain of E. coli Ml 5 (pREP4) containing a plasmid (pQE30) 
encoding B ASB034 from M, catarrhalis. was used to produce cell mass for purification 
of recombinant protein. The expression strain was cultivated on LB agar plates 
containing SO^g/ml kanamycin ("Kn") and lOOug/ml ampicillin ("Ap") to ensure both 
the pREP4 laclq control plasmid and the pQE30-BASB034 expression construct were 

20 both maintained. For cryopreservation at -80 °C, the strain was propagated in LB broth 
containing the same concentration of antibiotics then mixed with an equal volume of LB 
broth containing 30% (w/v) glycerol. 

Media 

25 The fermentation medium used for the production of recombinant protein consisted of 
2X YT broth (Difco) containing 50ug/ml fCn and lOOug/ml Ap. Antifoam was added to 
medium for the fermentor at 0.25 ml/L (Antifoam 204, Sigma). To induce expression of 
the BASB034 recombinant protein, IPTG (Isopropyl B-D-Thiogalactopyranoside) was 
added to the fermentor (I mM, final). 

30 
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Fermentation 

A 500-ml erlenmeyer seed flask, containing 50ml working volume, was inoculated with 
0.3 ml. of rapidly thawed frozen culture, or several colonies from a selective agar plate 
culture, and incubated for approximately 12 hours at 37 ± 1°C on a shaking platform at 
150rpm (Innova 2100, New Brunswick Scientific). This seed culture was then used to 
inoculate a 5-L working volume fermentor containing 2X YT broth and both Kn and Ap 
antibiotics. The fermentor (Bioflo 3000, New Brunswick Scientific) was operated at 37 
± 1 °C, 0.2 - 0.4 WM air sparge, 250 rpm in Rushton impellers. The pH was not 
controlled in either the flask seed culture or the fermentor. During fermentation, the pH 
ranged 6.5 to 7.3 in the fermentor. IPTG (1.0 M stock, prepared in sterile water) was 
added to the fermentor when the culture reached mid-log of growth (-0.7 O.D.600 
units). Cells were induced for 2 - 4 hours then harvested by centrifugation using either a 
28RS Heraeus (Sepatech) or RC5C superspeed centrifuge (Sorvall Instruments). Cell 
paste was stored at -20 C until processed. 

Purification 

Chemicals and Materials 

Imidazole, guanidine hydrochloride, Tris (hydroxymethyl), and EDTA (ethylene- 
diamine tetraacetic acid) biotechnology grade or better were all obtained from 
Ameresco Chemical, Solon, Ohio. Triton X-100 (t-Octylphenoxypolyethoxy-ethanol), 
sodium phosphate, monobasic, and Urea were reagent grade or better and obtained from 
Sigma Chemical Company, St. Louis, Missouri. Glacial acetic acid and hydrochloric 
acid were obtained from Mallincrodt Baker Inc., Phillipsburg, New Jersey. Methanol 
was obtained from Fisher Scientific, Fairlawn, New Jersey. PefablocOSC (4-(2- 
AminoethyI)-benzenesulfonylfuoride), Complete protease inhibitor cocktail tablets, and 
PMSF (phenylmethyl-sulfonylfluoride) were obtained from Roche Diagnostics 
Corporation. Indianapolis. Indiana. Bestatin, Pepstatin A, and E-64 protease inhibitor 
were obtained from Calbiochem, LaJolla, California. Dulbecco's Phosphate Buffered 
Saline(lx PBS) was obtained from Quality Biological. Inc.. Gaithersburg. Maryland. 
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Dulbecco's Phosphate Buffered Saline (iOx PBS) was obtained from BioWhittaker, 
Walkersville, Maryland. Penta-His Antibody, BSA free was obtained from QiaGen, 
Valencia, California. Peroxidase-conjugated AffmiPure Goat Anti-mouse IgG was 
obtained from Jackson Immuno Research, West Grove, Perm. AEC single solution was 
obtained from Zymed, South San Francisco, California. All other chemicals were 
reagent grade or better. 

Ni-NTA Superflow resin was obtained from QiaGen Inc., Valencia, California. Precast 
Tris-Glycine 4-20% and 10-20% polyacrylamide gels, all running buffers and solutions, 
SeeBlue Pre-Stained Standards, MultiMark Multi-Colored Standards and PVDF transfer 
membranes were obtained from Novex, San Diego, California. SDS-PAGE Silver Stain 
kits were obtained from Daiichi Pure Chemicals Company Limited, Tokyo, Japan. 
Coomassie Stain Solution was obtained from Bio-Rad Laboratories, Hercules, 
California. Acrodisc® PF 0.2 m syringe filters were obtained from Pall Gelman 
Sciences, Ann Arbor, Michigan. GD/X 25mm disposable syringe filters were obtained 
from Whatman Inc., Clifton, New Jersey. Dialysis tubing 8,000 MWCO was obtained 
from BioDesign Inc. Od New York, Carmal New York. BCA Protein Assay Reagents 
and Snake Skin dialysis tubing 3,500 MWCO were obtained from Pierce Chemical Co., 
Rockford, Illinois. 

Extraction Protocol 

Cell paste was thawed at room temperature for 30 to 60 minutes. Five to six grams of 
material was weighed out into a 50 ml disposable centrifuge tube. To this five 
mis/gram of Guanidine hydrochloride (Gu-HCl) buffer was added (6 M Guanidine 
hydrochloride, 100 mM Sodium phosphate, monobasic, 10 mM Tris and 0.05 % Triton 
X-100, pH 8.0). Cell paste was resuspended using a PRO300D proscientific 
homogenizer, at 3/4 power for one minute. The extraction mixture was then placed at 
room temperature with gentle agitation for 60 to 90 minutes. After 60 to 90 minutes the 
extraction mixture was centrifuged at 15,800 x g for 15 minutes (Sorvall RC5C 
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centrifuge, 1 1,500 rpm). The supernatant (SI) was decanted and saved for additionai 
purification. The pellet (PI) was saved for analysis. 

Binding of BASB034 to Nickel-NT A Resin 

To the SI three to four mis of Ni-NTA resin is added. This is then placed at room 
temperature with gentle agitation for one hour. After one hour the S 1 /Ni-NTA is 
packed into an XK16 Pharmacia column. The column is then washed with 1 M Gu-HCl 
buffer (1 M Guanidine hydrochloride, lOOmM Sodium phosphate, monobasic, 10 mM 
Tris and 0.05% Triton X-100, pH 8.0). This is then followed by a wash with phosphate 
buffer (lOOmM Sodium phosphate, monobasic, 10 mM Tris and 0.05% Triton X-100, 
pH 6.3). The protein is then eluted from the column with a 250 mM imidazole buffer 
(250 mM imidazole, lOOmM Sodium phosphate, monobasic, 10 mM Tris and 0.05% 
Triton X-100, pH 5.9). 

Final Formulation 

BASB034 was formulated by dialysis overnight against, three changes of 0.1 % Triton 
X-100 and lx PBS, pH 7.4, to remove residual Gu-HCl and imidazole. The purified 
protein was characterized and used to produced antibodies as described below. 

Biochemical Characterizations 
SDS-PAGE and Western Blot Analysis 

The recombinant purified protein was resolved on 4-20 % polyacrylamide gels and 
electrophoretically transferred to PVDF membranes at 100 V for 1 hour as previously 
described (Thebaine et al. 1979, Proc. Natl. Acad. Sci. USA 76:4350-4354). The 
PVDF membranes were then pretreated with 25 ml of Dulbecco's phosphate buffered 
saline containing 5 % non-fat dry milk. All subsequent incubations were carried out 
using this pretreatment buffer. 
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PVDF membranes were incubated with 25 ml of a 1:500 dilution of preimmune serum 
or rabbit anti-His immune serum for 1 hour at room temperature. PVDF membranes 
were then washed twice with wash buffer (20 mM Tris buffer, pH 7.5, containing 150 
mM sodium chloride and 0.05 % Tween-20). PVDF membranes were incubated with 
25 ml of a 1 :5000 dilution of peroxidase-labeied goat anti-rabbit IgG (Jackson 
ImmunoResearch Laboratories, West Grove, PA) for 30 minutes at room temperature. 
PVDF membranes were then washed 4 times with wash buffer, and were developed 
with 3-amtno-9-ethylcarbazole and urea peroxide as supplied by Zymed (San Francisco, 
CA) for 10 minutes each. 

The results of an SDS-PAGE (Figure 4) show a protein about 60 kDa that is reactive to 
an anti-RGS(His) antibody by western blots (Figure 5) of the SDS-PAGE. 

Protein Sequencing 

Amino terminal amino acid sequencing of the purified protein was performed to 
confirm the production of the correct recombinant protein using well defined chemical 
protocols on Hewlett-Packard model G 1000 A sequencer with a model 1090 LC and a 
Hewlett-Packard model 241 sequencer with a model 1 100 LC. 

Example S: Production of Antisera to Recombinant BASB034 

Polyvalent antisera directed against the BASB034 protein were generated by 
vaccinating two rabbits with the purified recombinant B ASB034 protein. Each animal 
is given a total of three immunizations intramuscullarly (i.m.) of about 20u.g BASB034 
protein per injection (beginning with complete Freund's adjuvant and followed with 
incomplete Freund's adjuvant) at approximately 21 day intervals. Animals were bled 
prior to the first immunization ("pre-bleed") and on days 35 and 57. 

Anti-BASB034 protein titres were measured by an ELISA using purified recombinant 
BASB034 protein (0.5 ug/well). The titre is defined as the highest dilution equal or 
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greater than 0. 1 as calculated with the following equation: average OD of two test 
samples of antisera - the average OD of two test samples of buffer. The titres after three 
immunizations were around 1000000. 

The antisera were used as the first antibody to identify the protein in a western blot as 
described in example 4 above. The western-blot shows the presence of anti-BASB034 
antibody in the sera of immunized animals (Figure 6). 



Example 6 : Analysis of the non-coding flanking regions of the BASB034 gene, 
and its exploitation for modulated BASB034 gene expression. 

The non-coding flanking regions of the BASB034 gene contain regulatory elements 
important in the expression of the gene. This regulation takes place both at the 
transcriptional and translational level. The sequence of these regions, either upstream or 
downstream of the open reading frame of the gene, can be obtained by DNA 
sequencing. This sequence information allows the determination of potential regulatory 
motifs such as the different promoter elements, terminator sequences, inducible 
sequence elements, repressors, elements responsible for phase variation, the shine- 
dalgarno sequence, regions with potential secondary structure involved in regulation, as 
well as other types of regulatory motifs or sequences. 

This sequence information allows the modulation of the natural expression of gene 
BASB034. The upregulation of the gene expression may be accomplished by altering the 
promoter, the shine-dalgamo sequence, potential repressor or operator elements, or any 
other elements involved. Likewise, downrcgulation of expression can be achieved by similar 
types of modifications. Alternatively, by changing phase variation sequences, the expression 
of the gene can be put under phase variation control, or may be uncoupled from this 
regulation. In another approach, the expression of the gene can be put under the control of 
one or more inducible elements allowing regulated expression. Examples of such regulation 
include, but are not limited to, induction by temperature shift, addition of inductor substrates 
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like selected carbohydrates or their derivatives, trace elements, vitamins, co-factors, metal 
ions, etc. 

Such modifications as described above can be introduced by several different means. 
The modification of sequences involved in gene expression can be done in vivo by 
random mutagenesis followed by selection for the desired phenotype. Another approach 
consists in isolating the region of interest and modifying it by random mutagenesis, or 
site-directed mutagenesis, insertion or deletion mutagenesis. The modified region can 
then be reintroduced into the bacterial genome by homologous recombination, and the 
effect on gene expression can be assessed. In another approach, the sequence knowledge 
of the region of interest can be used to replace or delete all or part of the natural 
regulatory sequences. In this case, the regulatory region targeted is isolated and 
modified so as to contain the regulatory elements from another gene, a combination of 
regulatory elements from different genes, a synthetic regulatory region, or any other 
regulator/ region, or to delete selected parts of the wild-type regulatory sequences. 
These modified sequences can then be reintroduced into the bacterium via homologous 
recombination into the genome. A non-exhaustive list of preferred promoters that could 
be used for up-regulation of gene expression includes the promoter porA, porB % IbpB, 
(bpB, pi 10 % 1st, hpvAB from N. meningitidis or N. gonorroheae. 
In one example, the expression of the gene can be modulated by exchanging its 
promoter with a stronger promoter (through isolating the upstream sequence of the 
gene, in vitro modification of this sequence, and reintroduction into the genome by 
homologous recombination). Upregulated expression can be obtained in both the 
bacterium as well as in the outer membrane vesicles shed (or made) from the bacterium. 
In other examples, the described approaches can be used to generate recombinant bacterial 
strains with improved characteristics for vaccine applications. These can be, but are not 
limited to, attenuated strains, strains with increased expression of selected antigens, strains 
with knock-outs (or decreased expression) of genes interfering with the immune response, 
strains with modulated expression of immunodominant proteins, strains with modulated 
shedding of outer-membrane vesicles. 
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A region directly upstream of the BASB034 gene is given in the sequence of SEQ ID 
NO: 13. This sequence is a further aspect of the invention. 
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SEQUENCE INFORMATION 

BASB034 Polynucleotide and Polypeptide Sequences 

SEQ ID NO:l 

5 Moraxella catarrhaiis BASB034 polynucleotide sequence from strain ATCC 43617 

ATGAAAGTTTCACTGTCTA»TTGACTTTATCTATTTTGT 

ACCAAATCCTGTGGCATTTGTTGACGAACTACGCAGTGA 

AAAGTGCGACAQVATCAGCGTCTACTGATACGGCTAATCCTTTAGACGA^ 
10 GAAAATAAAACCATGCTGATTAACTGCTCAGCACTTAATCAAGATATCATGCGTTTTC 

TGGTGAG ACG C CAGCGGTAATTAAAAC CAAG CGTT C CATT CG CCTTGATGAAACAATTTGGCAGAC CATCAAAGG CAAAC 

CCCAGGTTATCTATCAAGAAACGACAGATCCGA rrrTTlTAATGGGTAATGAAAAATCCATGCTGACCAAAAAAGATGCC 

AAACAG CTTGAATATG GAG CCAAACAG TTTACACCACTG AG CTTAT CATTTGATTTAG AC CG AAAT AAT ACAC CACTTTG 

GT CAT CACGAC CACACAATC CGATGTATGTATTGC CCATATTTATG CACGGTAAG C CTAAT CGAAG CC CAAATACG C C CA 
1 5 GTCATGAAGCAAAAGAATTTACCCCAAATGAATTTCGTG 

G CTGAGGATTTATGGGGG ACGGATT CAGATTTATGGTTTGGAT ATACACAGCAATCG CACTG^CAGATTTTTAATGG AAA 

AAACTCTCGTCCTTTTAGAGTACATGACTACCAGCCAGAGATTTTCTT 

ATGGCAAAGTCCGCATGATTGGCATGGGTGCGGTACATCATTCCAATGGTGAAAGTGCC 

CGTG CTTATTTGATGG CAGGCATGG AATGG AAAAACCTGACTGTCATGC CACG CATTTGGGGG CTGTATCTTTAAAGAGGG 
20 TAGTGGCAGCCAGCCAGATGATAATCCTGATATCTTGGACTATTATGGCT 
AAAATAAAAGTAATATTTCAGGTACGGTACGCTATAATC 

C CG CTTGGTAAGGGAATTAGTGG CrATTTTCAAATATTT CAAGG CTATGGGCAGTC TT TG ATTGATTATAATCATG AGG C 
G ACAAG CTTTGGCGTCGGACTTATG CTTAACC ACTGG ATGGGTCTATAA 

25 SEQ ID NO:2 

Moraxella catarrhaiis BASB034 polypeptide sequence deduced from the polynucleotide of 
SeQ ID NO:l 

MKVSl^TLTLSILSCFAIlAIQQAQAVPNPVAFVDET^ 
30 ENKTML I NCS ALNQD I MRIiACYDTL VHGET P AVI KTKRS I RLD ET I WQT I KG KPQ V I YQ ETTD P I FLMGN E KGMLTKKD A 
KQLEYAAKQFTPI^LSFDLORNK^PLWSSRPHNP^rm-PIFMHGKPMRSPKTPSHEAKQ^^PNEF 
AEDLWGTBSDLWFGYTQQSHWQIFT?GKNSRPFRVHDYQPE 

RAYLMAGMEWKNLTVMPRIWGRIFKEGSGSQPDDNPDILDYYGYGDVRFLYQLE^ 
PLGKGISGYFQIFQGYGOSLIDYNHEATSFGVGLMLNDWMGL 

35 

SEQ ID NO:3 

Moraxella catarrhaiis BASB034 polynucleotide sequence from strain Mc2908 

ATG AAAGTTTCACTGTCT ACATTG ACTTTATCrATTTTGCCATGTTTTC CT ATCCT CAAGCAC AAGCTGT 
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AC CAAATCCTGTGG CATTTGTTGACGAAGT ACG CAGTAAAAATCATCTTGGGCAAGACAATGAATTACT CATTGGTGTAC 
AAAGTG CGACACAATCAG CGT CT ACTGATACGG<nAATCCTTTAGACGAACATGAAC CAG AG CTTTATACGACAG CTTTA 
G AAAATAAAACCATGCTG ATTAACTGCTCAGCACTTAATCAAGATATCATG COT TT G CCGTG CTATGACACTTTGGTG CA 
TGGTGAG ACG C CAG CX3GTAATTAAAACCAAG CGTTCCATTCGCCTTGATG AA^ 
5 C C CAGGTTG T CT AT CAAG AAACG ACAG ATCCG ATTTTTTTAATGGGTAATGAAAAAGG CATGCTGACCAAAAAAG ATG C C 
AAACAG CTTG AAT ATG CAG C CAAACAG TTT ACAC CACTGAG CTTATCATTTGATTTAG AC CG AAATAATACACCG CTTTG 
GT CATCACG ACCACACAATCCGATGTATGTATTGCCCATATTTATGCACGGTAAGC 
GTCATGAAGCAAGACAATTTACCCOWUVTGAATTTCGTGCCCCTGAATTAAAATTTCAM 

G CTG AG G ATTT ATGGGGGACGG ATTCAG ATTT ATGGTTTGGG T ATACACAGCAAT CG CACTGG CAG ATTTTTAATGGAAA 
1 0 AAACT CTCGTC CTTTTAGAGT ACATG ATT AC CAG C CAG AG ATTTTCTTAACTCAAC CTGTGT ACT CAG ACTT ACCATGGG 
ATGGCAAAGTC CGCATG ATTGG CATGGGTGCGG TACATCATTC CAATGGTGAAAGTG CCAAACTGT CT CG CT CATGGAAT 
CGTGCTTATTTGATGGCAGGCATGGAATGGAAAAACCTGACT 

TAGTGG CAG C CAG CCAGATGACAAT CCTG ATAT CTTGGACTATT ATGGTTATGGTGATGTG CGTTT TTT ATAT CAACT AG 
AAAATAAAAGTAATATTTCAGGTACGGTACGCTATAATCCACGCT 
1 5 C CG CTTG GTAAGGGAATT AGTGG CTATTTT CAAATATTT CAAGG CT ATGGG CAGTCTTTG ATTGATTAT AATCATGAGGC . 
GACAAGCTTTGG CGTCGGACTTATG CTTAACGACTGGATGGGTCTATAA 



SEQ ED NO:4 

Moraxella catarrhalis BASB034 polypeptide sequence deduced from the polynucleotide of 
20 SeQ ID NO:3 

MKVSL£TLTI>SILPCFAIIAIQQAQAVPNPVAFVDEVRS 

ENKTML INCS ALNQD IMRIACYDTL VHGETPAVI KTKRS IRLDETI WQTI KGKPQVVYQETTD P I FLMGNEKGMLTKKDA 
KQLEYAAKQFTPLSI^FDLDRNNTPLWSSRPHNPMYVLPIFMHGKP^ 
25 ASDLWGTTJSDLWFGYTQQSHWQIFNGKNSRPFRVHDYQPEIFLTQPVYSDLPWDGKV^ 
RAYLMAGMEWKNLTVMPRIWGRIFKEGSGSQPDDNPDIIJ)YYGYGDVRFL^ 
P LG KG I SGY FQ I FQGYGQS L I D YNHEATS FGVG LMLND WMG L 

SEQIDNO:5 

30 Moraxella catarrhalis BASB034 polynucleotide sequence from strain Mc2913 

. ATG AAAGTTT CACTGTCTACATTGACTTTATCTA' I'lT'lti TC AlXy'lT T T GC T A T CCTAG CCATTCAGCAAGCAAAAGCTGT 
. ACCAAATCCTGTGGCATTrGTTGACGAAGTACGCAGTGAAAA 
AAAGTG CGACACAATCAGCGTCTACTGATACGGCTAATCCTTTAGACGAA 
3 5 G AAAATAAAAC CATGCTG ATTAACTG CTCAG CACTTAATCAAGATATCATGCG'l" r rGGC^TG CTATGACACTTTGGTG CA 
TGGTGAGACGCCAGCGGTAATTAAAACCAAGCGTTCCATTCGCCTTGAT^ 

CCCAGGTTGTCTATCAAGAAACGACAGATCCGA 1 1 III' 1" i'AATGGGTAATGAAAAAGG CATC CTGACCAAAAAAG ATG CC 
AAACAGCTTGAATATGCAGCCAAACAGTTTACACCACTGA 

GTCATCACGACCACACAATCCGATGTATGTATTGCCCATATTTATGCACGGTAAGCCT 
40 GTCATGAAGCAAGACAATTTACCCCAJ^TGAATTTCGTGCCCCTGAATT 

G CTG AGGATTTATGGGGGACGGATTCAGATTTATGGT l ' lU SATATACACAGCAATCGCACTGGCAGATT I "l l 'AATGGAAA 
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AAACTCTCGTCCTTTTAGAGTACATGATTACCAGCCAGAGATTT^ 

ATGGCAAAGTCCGCATGATTGGCATGGGTCCGGTACATCATT 

CGTGCTTATTTGATGGCAGGCATGGAATGGAAAA^ 

TAGTGGCAGCCAGCCAGATGACAATCCTGATATCTTGGACTATTATGGTTATC^ 
AAAATAAAAGTAATATTTCAGGTACGGTACGCTATAATCCACGCTCAGGCAAAGG 
CCGCTTGGTAAGGGAATTAGTGGCTATTTTCAAATATTTCAA 
GACAAGCTTTGGCGTCGGACTTATGCTTAACGACTGGATGGGTCTATAA 

SEQIDNO:6 

Moraxetla catarrhal* BASB034 polypeptide sequence deduced from the polynucleotide of 
SeQ ID NO:5 



15 



MKVSLSTLTI^ILSCFAILAIQQAKAVPNPVAFVDE 

ENKTMLINCSALNQDIMRtACYDTLVHGETPAVIKTKRS IRLOETIWQTIKGKPQWYQETTDP IFLMGNEKGMLTKKDA 
KQLEYAAKQFTPLSLSFDLDRNNTPLWSSRPHNPM^ 

MDLWGTOSDLWFG ^^ SH WQIFNGKNSRPFRVHDYQPEIFLTQPVYSDLPWTC 
RAYT^IAGMEWKNLTVMPRIWGRIFKEGSGSQPDDNPDILDYYGYGDVRFLYQL^ 
P LGKG I S G YFQ I FQGYGQS L ID YNHEATS FGVGLMLNDWMGL 

gg 20 SEQ ID NO:7 

Q Moraxetla catarrhalis BASB034 polynucleotide sequence from strain Mc2969 

JU ATGAAAGTTTCACTGTCTA^TTGACITTA^ 

i^S ACCAAATCCTGTGGCATTTGTTGACGAAGTACGCAGTGAAAATGATCTTC^^ 

U 25 AAAGTGCGACACAATCGGCGTCTACTGATACGGCTAATCCTTTAGACGAACATGAACCAGAGCT 

=P GA A^TAAAACCATGCTGATTAACIX3CTCAG<^OT 

O TGGTGAGACGCCAGCGGTAATTAAAACCAAGCGTTCCATTCGCCTTGATGAAACA^ 

^ CCCAGGTTGTCTATCAAGAAACGACAGATCCGATTTTTTTAATC^^ 

AAACAGCTTGAAT ATGCAG C CAAACAGTTTACACCACTGAG CTTATCATTTGATTTAGACCG AAATAATACACCACTTTG 
30 GT ^TCACG ACCACACAAT CCGATGTATGTATTGCCCATA CG AAG CC CAAATACG CCCA 
GTCATGAAGCAAAACAATTTACCCCAAATGAATTTCGTGCTCCCGAGCTA 
GCTGAGGATTTATGGGGGACGGATTCAGATTTATGGTTTGGATATACACAGCAATCGCACT 
AAACTCTCGTCCTTTTAGAGTACATGACTACCAGCCAGAGATTTTCTTAACTCAACCT 
ATCGCAAAGTCCCCATGATTGGCATGGGTGCGGTACATCATTCCAATGGTGAAAGTGCCA 

CGTGCTTATTTG ATGGCAGGCATGG AATGGAAAAACCTG ACTGTCATGCCACG CATTTGGGGG CGT ATCTTTAAAG AGCG 
T AGTGG CAG CCAG C CAG ATG ATAATCCTG AT AT CTTGG ACTATT ATGG TT ATGGTG ATGT G CG TTTTTT AT AT CAACT AG 
AAAAT AAAAGT AAT ATTT CAGGT ACGG T ACG CT AT AAT C CACG CTCAGG CAAAGG TG CG TTG CAACTTG ACT ATG T CT AT 
CCG CTTGG T AAGGG AATT AG TG G CT ATTTT CAAATATTT CAAGG CT ATGGG CAGT CTTTG AT TG ATT AT AAT CATG AGG C 
GACAAGCTTTGGCGTCGGACTTATGCTTAACGACTGGATGGGTCTATAA 



35 
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SEQ ID N0:8 

&qTd roT"**^ BASB ° 34 P 0, yP e P ,id « sequence deduced from the polynucleotide of 

MKVSt^TLTLSILPCFAILAIQQAQAVPMPVAFVOBVRSENDLGQDNELPIDVQSATQSASTDTAIIPLDEHEPELVTTAL 
ENKTML INCS ALNQD IMRLACYDTLVHGETPAVI KTKRS I RLDET I WQT I KGKPQ WYQETTD P I FLMGNE KGMLTKKDA 

KQLEYAAKQFTPI^LSFDLDt^PLWSSRPHNPmfVLPIFMHGKPNRSPtn'PSHEAKQFTPNEFRAPELKFQVSV^ 
AEDLWGTOSDLWPGVTQQSHWQIFNGlWSRPraTODYQPEIFXTQPVYSDLPWDGKVRMrGMGAVHHSNGESAKLSRSWN 
10 RAYLMAGMEWKTn<TVMPRIWGRIFKEGSGSQPDDNPDILOYYGYGDVRFLTfQLEI/KSNISGTVRYNPRSGKGALQLOYVY 
PLGKGISGYFQIFQGYGQSLIDYNHBATSFGVGLMLNDWMGL 



5 



15 



20 



30 



40 



SEQ ID NO:9 

GAT TTA AGA GTA TGT TAT GAT G 
SEQ ID NO:10 

GTA TGG GTT GAT CAA ATA CAG 
SEQ ID NO:ll 

AAG GGC CCA ATT ACG CAG AGG GGA TCC CAA GCT GTA CCA AAT CCT 
GTG GCA TTT GTT G 



SEQ ID NO:12 

AAG GGC CCA ATT ACG CAG AGG GTC GAC TTA TTA TAG ACC CAT CCA 
25 GTC GTT AAG CAT AAG 

SEQ ID NO: 13 



ACTTGGCGAAAATACCATTTATATCGATTGTGATGWATACAGGCAGATGGCGGT 
CTGCGGTGGCACTTATTGATGCTTTAGAACACTTGCAGCGTCGTAAAAAGCTTO 
GCAGCGGTTTCTGTGGGTGTTAATCAAGGCOGTGTATTGCTTGATTTGGACT 
. " TTTAAATGTGGTCAICAWCAGGCAGGTGGGTTTATTGAGATTCAAGGCACAGCAGAAGAAAAGCCATTTA 
AAGCTAATCCGATGCTTGATTTGGCAGAGCTGGGAATTGGGCAGATTATCGAAGCCCAAAAGCAAGTATT^ 
TATCCTAATCGTTGAAGATAATGGCGTGATCATC^ 
35 TATTGCTGCTGCTGGGTGTCTTGGTGGCAATCATTTCTTTGATTGCACCCGTTTTTTATGCAA 

TTTGCAGTTGTGGTAr 1 I lj I ^'I'l'i'AATATTCAAAGGCAARAAGCCAAAACTTCTCATATGTTTTCACAAGGTCGCTTGAA 
GATTACGTCCAAACGCTTTGAGATTCATAACAAATCACTAACCTTATCAGCATCGGCAACAATATCTGCTAAAGATAAC^ 
AAATGACAATTGTTGATCGGGGCATTGAATATCATrTTACAGGTTTTGCTGAT^ 



GTACTTTTGCraAAAGTCAATCAAAACCAATGCGGTGGCGGTAACATTGGCTAACT 

ATGGTCTTTAACTCCACCCACCTAACI r 1 1 1 C rTTGTTTGGATTTAAGAGT ATGTTATG ATGGG CAGGATTTTATTTTAA 
GTCATCATTTAATGCAATCAGTTGTCCAGAGTAGCCGTTC 
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Deposited materials 

A deposit containing a Moraxella catarrhalis Catlin strain has been deposited with the American 
Type Culture Collection (herein "ATCC") on June 21, 1997 and assigned deposit number 43617. 
The deposit was described as Branhamella catarrhalis (Frosch and Kolle) and is a freeze-dried, 1 .5- 
2.9 kb insert library constructed from M. catarrhalis isolate obtained from a transtracheal aspirate of 
a coal miner with chronic bronchitits. The deposit is described in Antimicrob. Agents Chemother. 
21:506-508(1982). 

The Moraxella catarrhalis strain deposit is referred to herein as "the deposited strain" or as "the 
DNA of the deposited strain." 

The deposited strain contains a full length BASB034 gene. 

A deposit of the vector pMC-PLAl consisting of Moraxella catarrhalisUNA inserted in pQE30 has 
been deposited with the American Type Culture Collection (ATCQ on February 12 1999 and 
assigned deposit number 207099. 

20 The sequence of the polynucleotides contained in the deposited strain, or in the deposited clone, as 
well as the amino acid sequence of any polypeptide encoded thereby, are controlling in the event of 
any conflict with any description of sequences herein. 

The deposits of the deposited strain/clone have been made under the terms of the Budapest Treaty 
25 on the International Recognition of the Deposit of Micro-organisms for Purposes of Patent 
Procedure. The deposited strains will be irrevocably and without restriction or condition released to 
the public upon the issuance of a patent The deposited strains are provided merely as convenience 
to those of skill in the art and are not an admission that a deposit is required for enablement, such as 
that required under 35 U.S.C. §1 12. 
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SEQUENCE LISTING 
<110> SmichKXine Beecharo Biologicals S.A. 
<120> Novel Compounds 

<130> BM4S332 
<160> 13 

<170> FastSEQ for Windows Version 3.0 



%jp <210> 1 

SJ <2X1> 1329 



<212> DNA 

<213> Moraxella catarrhalis 



<400> 1 












aegaaagtct 


cactgtctac 


aetgacteta 


tctattttgt 


catgttcegc 


taecctagcc 


60 


actcagcaag 


cacaagctgt 


accaaatcct 


gtggcaeccg 


ttgacgaagt 


acgcagtgaa 


120 


aatgatctcg 


ggcaagacaa 


cgaaccaccc 


attgatgtcc 


aaagtgcgac 


acaatcagcg 


180 


tccactgata 


cggctaatcc 


tttagacgaa 


catgaaccag 


agctttatac 


gacagcttta 


240 


gaaaataaaa 


ccatgctgac 


taactgctca 


gcacteaaee 


aagacaccac 


gcgtccggcg 


300 


tgccatgaca 


ctctggegca 


tggtgagacg 


ccagcggtaa 


ccaaaaccaa 


gcgttccate 


360 


cgccttgatg 


aaacaaeetg 


gcagaccatc 


aaaggcaaac 


cceaggccac 


ctatcaagaa 


420 


acgacagatc 


cgattttttt 


aatgggtaat 


gaaaaaggca 


tgcegaccaa 


aaaagacgcc 


480 


aaacagcetg 


aacacgcagc 


caaacagtet 


acaccactga 


gcttatcatc 


tgacccagac 


540 


cgaaataata 


caccactttg 


gccatcacga 


ccaeacaatc 


cgatgtatgt 


aeegcccata 


600 


tttacgcacg 


gtaagcctaa 


tcgaagccca 


aacacgccca 


gccatgaagc 


aaaacaactc 


660 


accccaaacg 


aacctcgcgc 


tcccgagcca 


aaattccagg 


cccccgccaa 


ggtcaaagcc 


720 


gctgaggate 


tatgggggac 


ggattcagac 


ctacggcttg 


gatacacaca 


gcaategcac 


780 


cggcagatcc 


tcaatggaaa 


aaactcecgc 


ccttccagag 


eacatgacca 


ccagccagag 


840 


attcccctaa 


cccaacccgt 


acactcagac 


CCaccacggg 


acggcaaagc 


ccgcatgacc 


900 


ggcatgggcg 


cggcacacca 


tcccaaeggt 


gaaagtgcca 


aactgtcccg 


cccacggaac 


960 


cgtgcccaee 


cgacggcagg 


cacggaaegg 


aaaaacccga 


ccgccacgcc 


acgcactcgg 


1020 


gggcgcatcc 


ccaaagaggg 


tagcggcagc 


cagccagacg 


acaaccccga 


cacctcggac 


1080 


cactatggtt 


acggcgacgt 


gcgcccccca 


caccaaccag 


aaaacaaaag 


caacacccca 


1140 
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ggcacggcac gctacaaccc acgctcaggc aaaggtgcgt tgcaacttga ctacgtctac 1200 

ccgcttggta agggaattag tggctatttt caaatatttc aaggctatgg gcagtctttg 1260 

attgattaca atcatgaggc gacaagcttt ggcgccggac ttatgcttaa cgactggatg 1320 
ggeccataa 



1329 



<210> 2 
<211> 442 
<212> PRT 

<213> Moraxella catarrhal is 
<400> 2 

Met Lys Val Ser Leu Ser Thr Leu Thr Leu Ser He Leu Ser Cys Phe 

1 5 10 IS 

Ala He Leu Ala He Gin Gin Ala Gin Ala Val Pro Asn Pro Val Ala 
£3 20 25 30 

J3 phe Val Glu v *l Arg Ser Glu Asn Asp Leu Gly Gin Asp Asn Glu 

35 40 45 

w Leu Pro He Asp Val Gin Ser Ala Thr Gin Ser Ala Ser Thr Asp Thr 

HI S0 55 60 

m Ma Asn Pro Lcu Asp Glu His Glu Pro Glu Leu Tyr Thr Thr Ala Leu 

yj 6S 70 75 30 

n Glu Asn Lys Thr Met Leu He Asn Cys Ser Ala Leu Asn Gin Asp He 

© 85 90 95 

Mec Arg Leu Ala Cys Tyr Asp Thr Leu Val His Gly Glu Thr Pro Ala 
% 100 105 110 

% Val Ile h Y 3 Lys Arg Ser He Arg Leu Asp Glu Thr He Trp Gin 

C 11S 120 125 

Thr Ile Lys Gly Lys Pro Gin Val lie Tyr Gin Glu Thr Thr Asp Pro 

130 135 140 

He Phe Leu Met Gly Asn Glu Lys Gly Met Leu Thr Lys Lys Asp Ala 
14S ISO 1SS 1S0 

Lys Gin Leu Glu Tyr Ala Ala Lys Gin Phe Thr Pro Leu Ser Leu Ser 

lfi 5 170 175 

Phe Asp Leu Asp Arg Asn Asn Thr Pro Leu Trp Ser Ser Arg Pro His 

l «0 185 190 

Asn Pro Met Tyr Val Leu Pro He Phe Met His Gly Lys Pro Asn Arg 

19 S 200 205 

Ser Pro Asn Thr Pro Ser His Glu Ala Lys Gin Phe Thr Pro Asn Glu 
210 21S 220 



7^ 




WO 00/15S02 PCT/EP99/06781 

Phe Arg Ala Pro Glu Leu Lys Phe Gin Val Ser Val Lys Val Lys Ala 
225 230 235 240 

Ala Glu Asp Leu Tip Gly Thr Asp Ser Asp Leu Trp Phe Gly Tyr Thr 

245 250 255 

Gin Gin Ser His Trp Gin lie Phe Asn Gly Lys Asn Ser Arg Pro Phe 

260 265 270 

Arg Val His Asp Tyr Gin Pro Glu lie Phe Leu Thr Gin Pro Val Tyr 

275 280 285 

Ser Asp Leu Pro Trp Asp Gly Lys Val Arg Met lie Gly Met Gly Ala 

290 295 300 

Val His His Ser Asn Gly Glu Ser Ala Lys Leu Ser Arg Ser Trp Asn 
305 310 315 320 

Arg Ala Tyr Leu Met Ala Gly Met Glu Trp Lys Asn Leu Thr Val Met 

325 330 335 

Pro Arg lie Trp Gly Arg lie Phe Lys Glu Gly Ser Gly Ser Gin Pro 

340 34S 3S0 

Asp Asp Asn Pro Asp He Leu Asp Tyr Tyr Gly Tyr Gly Asp Val Arg 

355 360 365 

Phe Leu Tyr Gin Leu Glu Asn Lys Ser Asn He Ser Gly Thr Val Arg 

370 375 380 

Tyr Asn Pro Arg Ser Gly Lys Gly Ala Leu Gin Leu Asp Tyr Val Tyr 
385 390 395 400 

Pro Leu Gly Lys Gly He Ser Gly Tyr Phe Gin He Phe Gin Gly Tyr 

405 410 415 

Gly Gin Ser Leu He Asp Tyr Asn His Glu Ala Thr Ser Phe Gly Val 

420 425 430 

Gly Leu Met Leu Asn Asp Trp Met Gly Leu 
435 440 

<210> 3 
<211> 1329 
<212> DNA 

<213> Moraxella catarrhal is 
<400> 3 

acgaaagttt cactgtccac attgacctta tctactttgc catgttttgc tatcctagcc 60 
attcagcaag cacaagctgt accaaatccc gcggcacttg ttgacgaagt acgcagtaaa 120 
aatgatcttg ggcaagacaa tgaattactc attggtgtac aaagtgcgac acaatcagcg 180 
cctactgata cggctaatcc ttcagacgaa cacgaaccag agctttatac gacagctcta 240 
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gaaaataaaa 
cgctatgaca 
cgcctcgatg 
acgacagacc 
aaacagcetg 
cgaaacaaca 
ct eatgcacg 
accccaaatg 
gccgaggatt 
tggcagaccc 
atttccccaa 
ggcacgggcg 
cgtgctcact 
gggcgcatcc 
caccacggtc 
ggcacggtac 
ccgcctggta 
attgattata 
ggtctaeaa 



ccatgctgat 
ceetggcgca 
aaacaacctg 
cgatttcttc 
aatatgcagc 
caccgccttg 
gcaagcceaa 
aatttcgtgc 
tacgggggac 
ttaatggaaa 
cccaacctgc 
cggtacatca 
tgacggcagg 
ttaaagaggg 
atggtgatgt 
gctataatcc 
agggaattag 
accatgaggc 



taactgctca 
tggtgagacg 
gcagaccacc 
aatgggtaat 
caaacagttt 
gtcatcacga 
Ccgaagccca 
ccctgaacta 
ggattcagat 
aaactctcgt 
gtactcagac 
ttccaatggt 
catggaacgg 
tageggcagc 
gcgctttcta 
acgctcaggc 
tggctacttc 
gacaagcttt 



gcacctaacc 
ccagcggtaa 
aaaggcaaac 
gaaaaaggca 
acaccactga 
ccacacaacc 
aatacgccca 
aaactccaag 
teaeggtttg 
ccecttagag 
etaccatggg 
gaaagtgcca 
aaaaacctga 
cagccagatg 
tatcaaceag 
aaaggcgcgt 
caaacatttc 
ggcgtcggac 



aagacatcac 
Ccaaaaccaa 
cccaggctgc 
tgctgaccaa 
gcttatcatt 
cgatgtatgc 
gtcacgaagc 
tctctgttaa 
ggtatacaca 
tacatgatta 
atggcaaagt 
aactgtcccg 
ctgtcatgcc 
acaaccccga 
aaaataaaag 
tgcaacttga 
aaggctatgg 
ttaegctcaa 



gcgcttggcg 
gcgttccact 
ctatcaagaa 
aaaagatgcc 
tgacttagac 
attgcccaca 
aagacaactc 
ggttaaagcc 
gcaaccgcac 
ccagccagag 
ccgcacgact 
ctcatggaat 
acgcactcgg 
taccctggac 
taatatetca 
ctacgtccac 
gcagtctttg 
cgaceggaeg 



300 
360 
420 
480 
S40 
600 
€60 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1329 



3 

in 



<210> 4 
<211> 442 
<212> PRT 

<213> Moraxella catarrhalis 



<400> 4 

Met Lys Val Ser Leu Ser Thr Leu Thr Leu Ser lie Leu Pro Cys Phe 

1 . 5 10 15 

Ala He Leu Ala lie Gin Gin Ala Gin Ala Val Pro Asn Pro Val Ala 

20 25 30 

Phe Val Asp Glu Val Arg Ser Lys Asn Asp Leu Gly Gin Asp Asn Clu 

3S 40 45 

Leu Leu lie Gly Val Gin Ser Ala Thr Gin Ser Ala Ser Thr Asp Thr 

50 S5 60 

Ala Asn Pro Leu Asp Glu His Glu Pro Glu Leu Tyr Thr Thr Ala Leu 
65 7 ° 75 80 

Glu Asn Lys Thr Met Leu He Asn Cys Ser Ala Leu Asn Gin Asp He 

8S 90 95 

Mec Arg Leu Ala Cys Tyr Asp Thr Leu Val His Gly Glu Thr Pro Ala 



4 



i6 





Si? / W 
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100 



105 



110 



f*1 




Val lie Lys Thr Lys Arg Ser He Arg Leu Asp Glu Thr He Trp Gin 

US 120 125 

Thr He Lys Gly Lys Pro Gin Val Val Tyr Gin Glu Thr Thr Asp Pro 

130 135 140 

lie Phe Leu Met Gly Asn Glu Lys Gly Met Leu Thr Lys Lys Asp Ala 
145 150 iss ISO 

Lys Gin Leu Glu Tyr Ala Ala Lys Gin Phe Thr Pro Leu Ser Leu Ser 

1S5 170 175 

Phe Asp Leu Asp Arg Asn Asn Thr Pro Leu Trp Ser Ser Arg Pro His 

180 18S 190 

Asn Pro Met Tyr Val Leu Pro He Phe Met His Gly Lys Pro Asn Arg 

19S 200 205 

Ser Pro Asn Thr Pro Ser His Glu Ala Arg Gin Phe Thr Pro Asn Glu 

210 215 220 

Phe Arg Ala Pro Glu Leu Lys Phe Gin Val Ser Val Lys Val Lys Ala 
225 230 235 240 

Ala Glu Asp Leu Trp Gly Thr Asp Ser Asp Leu Trp Phe Gly Tyr Thr 

245 250 2S5 

Gin Gin Ser His Trp Gin He Phe Asn Gly Lys Asn Ser Arg Pro Phe 

260 265 270 

Arg Val His Asp Tyr Gin Pro Glu He Phe Leu Thr Gin Pro Val Tyr 

27S 280 28S 

Ser Asp Leu Pro Trp Asp Gly Lys Val Arg Mec He Gly Met Gly Ala 

290 295 300 

Val His His Ser Asn Gly Glu Ser Ala Lys Leu Ser Arg Ser Trp Asn 
305 310 3is 320 

Arg Ala Tyr Leu Met Ala Gly Met Glu Trp Lys Asn Leu Thr Val Met 

325 33Q 33s 

Pro Arg He Trp Gly Arg He Phe Lys Glu Gly Ser Gly Ser Gin Pro 

340 345 3S0 

Asp Asp Asn Pro Asp He Leu Asp Tyr Tyr Gly Tyr Gly Asp Val Arg 

355 360 3S5 

Phe Leu Tyr Gin Leu Glu Asn Lys Ser Asn He Ser Gly Thr Val Arg 

370 375 380 

Tyr Asn Pro Arg Ser Gly Lys Gly Ala Leu Gin Leu Asp Tyr Val Tyr 
385 390 39S 400 

Pro Leu Gly Lys Gly He Ser Gly Tyr Phe Gin He Phe Gin Gly Tyr 



405 



410 



415 



7^ 
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Gly Gin Ser Leu He Asp Tyr Aan Hi 3 Glu Ala Thr Ser Phe Gly Val 

420 425 430 

Gly Leu Met Leu Aan Asp Trp Mec Gly Leu 
435 44Q 

<210> 5 
<211> 1329 
<212> DNA 

<213> Moraxella. catarrhal is 
<400> 5 

acgaaagtcc cactgtctac attgacttta tctacettge cacgtttegc eatcctagcc SO 
attcagcaag caaaagctgt accaaatcct gcggcatttg ttgacgaagt acgcagtgaa 120 
aatgatcttg ggcaagacaa tgaaccaccc attgatgtcc aaagegcgac acaatcagcg 180 
tccactgata cggctaaccc ettagacgaa cacgaaccag agcCCtatac gacagcetta 240 
gaaaataaaa ccacgctgat taactgctca gcacttaaec aagacaecac gcgcecggcg 300 
tgctatgaca ctttggtgca tggtgagacg ccagcggtaa ttaaaaccaa gcgttccaee 360 
cgccttgatg aaacaacttg gcagaccaec aaaggcaaac cccaggtcgt ccatcaagaa 420 
acgacagatc cgattttttt aacgggtaat gaaaaaggca tgccgaccaa aaaagatgcc 480 
aaacagcttg aacatgcagc caaacagttt acaccaccga gcttaccaet tgatttagac 
cgaaataaea caccaccttg gncaccacga ccacacaatc cgatgcaege attgcccata 
cttatgcacg gtaagcctaa tcgaagccca aatacgccca gccatgaagc aagacaattt 660 
accccaaatg aatcccgtgc ccctgaatta aaatcccaag tccctgttaa ggtcaaagct 720 
gccgaggatt Cacgggggac ggatccagat ttatggtetg gatatacaca gcaatcgcac 780 
tggcagactt etaatggaaa aaacccccgt cctttcagag tacatgatta ccagccagag 
accctcttaa ctcaacctgc acacccagac ctaccacggg atggcaaagt ccgcacgatt 
ggcacgggcg cggtacatca ttccaacggc gaaagcgcca aactgtctcg ctcatggaae 960 
cgcgctcatt tgatggcagg catggaatgg aaaaacctga ctgtcacgcc acgcatccgg 1020 
gggcgcatct ttaaagaggg tagcggcagc cagccagatg acaaccccga taccttggac 1080 
tactacggtt atggtgacgn gcgtctttCa taccaaccag aaaacaaaag taatatctca U40 
ggcacggtac gctataaccc acgctcaggc aaaggcgcgc cgcaactcga ctatgtctat 1200 
ccgcttggta agggaactag tggccatttt caaaeatttc aaggctatgg gcagtctttg 1260 
atcgattaca accacgaggc gacaagcctt ggcgccggac ttatgcccaa cgaccggatg 1320 
ggcccataa 



<210> 6 
<211> 442 
<212> PRT 

<213> Woraxella cacarrhalis 



540 
600 



840 
900 



1329 
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<400> 6 



Mec Lys Val Ser Leu Ser Thr Leu Thr Leu Ser He Leu Ser Cys Phe 

1 S 10 is 

Ala lie Leu Ala He Gin Gin Ala Lys Ala Val Pro Asn Pro Val Ala 

20 25 30 

Phe Val Asp Glu Val Arg Ser Glu Asn Asp Leu Gly Gin Asp Asn Glu 

35 40 4S 

Leu Pro He Asp Val Gin Ser Ala Thr Gin Ser Ala Ser Thr Asp Thr 

50 55 60 

Ala Asn Pro Leu Asp Glu His Glu Pro Glu Leu Tyr Thr Thr Ala Leu 
SS 70 75 80 

Glu Asn Lys Thr Met Leu lie Asn Cys Ser Ala Leu Asn Gin Asp He 

85 90 95 

Met Arg Leu Ala Cys Tyr Asp Thr Leu Val His Gly Glu Thr Pro Ala 

100 105 HO 

Val He Lys Thr Lys Arg Ser He Arg Leu Asp Glu Thr He Trp Gin 

US 120 125 

Thr He Lys Gly Lys Pro Gin Val Val Tyr Gin Glu Thr Thr Asp Pro 

130 135 140 

He Phe Leu Met Gly Asn Glu Lys Gly Met Leu Thr Lys Lys Asp Ala 
145 ISO 155 ISO 

Lys Gin Leu Glu Tyr Ala Ala Lys Gin Phe Thr Pro Leu Ser Leu Ser 

165 170 175 

Phe Asp Leu Asp Arg Asn Asn Thr Pro Leu Trp Ser Ser Arg Pro His 

180 las 190 

Asn Pro Met Tyr Val Leu Pro He Phe Met His Gly Lys Pro Asn Arg 

195 200 205 

Ser Pro Asn Thr Pro Ser His Glu Ala Arg Gin Phe Thr Pro Asn Glu 

210 21S 220 

Phe Arg Ala Pro Glu Leu Lys Phe Gin Val Ser Val Lys Val Lys Ala 
225 230 235 240 

Ala Glu Asp Leu Trp Gly Thr Asp Ser Asp Leu Trp Phe Gly Tyr Thr 

24S 250 25S 

Gin Gin Ser His Trp Gin He Phe Asn Gly Lys Asn Ser Arg Pro Phe 

260 2S5 270 

Arg Val His Asp Tyr Gin Pro Glu He Phe Leu Thr Gin Pro Val Tyr 

27S 200 28S 

Ser Asp Leu Pro Trp Asp Gly Lys Val Arg Met He Gly Met Gly Ala 
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290 295 300 

Val His His Ser Asn Gly Glu Ser Ala Lys Leu Ser Arg Ser Trp Asn 
305 310 315 320 

Arg Ala Tyr Leu Met Ala Gly Mec Glu Trp Lys Asn Leu Thr Val Met 

325 330 335 

Pro Arg He Trp Gly Arg He Phe Lys Glu Gly Ser Gly Ser Gin Pro 

- 340 345 350 

Asp Asp Asn Pro Asp Ue Leu Asp Tyr Tyr Gly Tyr Gly Asp Val Arg 

3SS 360 3G5 

Phe Leu Tyr Gin Leu Glu Asn Lys Ser Asn lie Ser Gly Thr Val Arg 

370 375 380 

Tyr Asn Pro Arg Ser Gly Lys Gly Ala Leu Gin Leu Asp Tyr Val Tyr 
365 3 *° 395 400 

Pro Leu Gly Lys Gly He Ser Gly Tyr Phe Gin He Phe Gin Gly Tyr 

40S 410 41S 

Gly Gin Ser Leu He Asp Tyr Asn His Glu Ala Thr Ser Phe Gly Val 



420 



425 430 



Gly Leu Met Leu Asn Asp Trp Met Gly Leu 
435 440 

<210> 7 
<211> 1329 
<212> DNA 

<213> Moraxella catarrhalis 
<400> 7 

atgaaagctt cactgtctac attgactcta tctattttgc catgttttgc catcctagcc 60 

attcagcaag cacaagctgt accaaatcct gtggcatttg ttgacgaagt acgcagtgaa 120 

aatgaccttg ggcaagacaa tgaattaccc attgatgtcc aaagtgcgac acaatcggcg 180 

tctactgata cggctaatcc tttagacgaa catgaaccag agctttatac gacagcttta 240 

gaaaataaaa ccatgctgat taactgctca gcacttaatc aagacatcat gcgtttggcg 300 

tgctatgaca ctttggtgca tggtgagacg ccagcggtaa ttaaaaccaa gcgttccatt 360 

cgccttgatg aaacaatttg gcagaccatc aaaggcaaac cccaggttgt ctatcaagaa 420 

acgacagatc cgattttttt aatgggtaat gaaaaaggca tgctgaccaa aaaagatgcc 480 

aaacagcttg aatatgcagc caaacagttt acaccactga gcttatcatt tgatttagac 540 

cgaaataata caccacttcg gtcaccacga ccacacaacc cgatgtatgt attgcccata 600 

tttacgcacg gtaagcctaa tcgaagccca aatacgccca gccatgaagc aaaacaatct 660 

accccaaatg aatctcgcgc ccccgagcta aaattccagg ttcctgttaa ggttaaagct 720 

gctgaggact: cacgggggac ggactcagat ccatggttcg gatacacaca gcaaccgcac 780 



71 
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tggcagattt 
attttcttaa 
ggcatgggtg 
egtgettatt 
gggcgtatct 
tattatggtt 
ggcacggcac 
ccgcttggta 
attgattata 
ggtctataa 



ttaatggaaa 
ctcaacctgt 
cggtacatca 
cgacggcagg 
ttaaagaggg 
atggtgatgt 
gctataatcc 
agggaattag 
atcatgaggc 



aaactctcgc 
acactcagac 
ttccaatggt 
catggaatgg 
tagtggcagc 
gcgttttcta 
acgctcaggc 
tggctatttt 
gacaagcttt 



ccttttagag 
ttaccatggg 
gaaagtgcca 
aaaaacccga 
cagccagatg 
taecaaccag 
aaaggcgcgc 
caaatatttc 
ggcgtcggac 



tacatgacta 
atggcaaagt 
aaccgcctcg 
ccgtcacgcc 
ataatcctga 
aaaataaaag 
tgcaacttga 
aaggctatgg 
ttatgcttaa 



ccagccagag 
ccgcatgatt 
ctcatggaat 
acgcatttgg 
tatcttggac 
taatatttca 
ctatgtctat 
gcagtctttg 
cgaccggacg 



840 
900 
950 
1020 
1080 
1140 
1200 
1260 
1320 
1329 



<2io> a 

<211> 442 
<212> PRT 

<213> Moraxella. cacarrhalis 



<400> 8 

MeC Lys Val Ser Leu Ser Thr Leu Thr Leu Ser He Leu Pro Cys Phe 

1 5 io is 

Ala He Leu Ala lie Gin Gin Ala Gin Ala Val Pro Asn Pro Val Ala 

20 25 30 

Phe Val Asp Glu Val Arg Ser Glu Asn ; Asp Leu Gly Gin Asp Asn Glu 

35 40 4S 

Leu Pro He Asp Val Gin Ser Ala Thr Gin Ser Ala Ser Thr Asp Thr 

SO 55 fio 

Ala Asn Pro Leu Asp Glu His Glu Pro Glu Leu Tyr Thr Thr Ala Leu 
65 70 75 80 

Glu Asn Lys Thr Met Leu He Asn Cys Ser Ala Leu Asn Gin Asp He 

85 so 95 

Met Arg Leu Ala Cys Tyr Asp Thr Leu Val His Gly Glu Thr Pro Ala 

100 ios 110 

Val He Lys Thr Lys Arg Ser He Arg Leu Asp Glu Thr He Trp Gin 

US 120 125 

Thr He Lys Gly Lys Pro Gin Val Val Tyr Gin Glu Thr Thr Asp Pro 

130 13S 140 

He Phe Leu Met Gly Asn Glu Lys Gly Met Leu Thr Lys Lys Asp Ala 
US ISO iss 160 

Lys Gin Leu Glu Tyr Ala Ala Lys Gin Phe Thr Pro Leu Ser Leu Ser 
165 !7 0 175 
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Phe Asp Leu Asp Arg Asn Asn Thr Pro Leu Trp Ser Ser Arg Pro His 

180 185 190 

Asn Pro Met Tyr Val Leu Pro He Phe Met His Gly Lys Pro Asn Arg 

19S 200 205 

Ser Pro Asn Thr Pro Ser His Glu Ala Lys Gin Phe Thr Pro Asn Glu 

210 21S 220 

Phe Arg Ala Pro Glu Leu Lys Phe Gin Val Ser Val Lys Val Lys Ala 
22S 230 23S 240 

Ala Glu Asp Leu Trp Gly Thr Asp Ser Asp Leu Trp Phe Gly Tyr Thr 

245 250 255 

Gin Gin Ser His Trp Gin He Phe Asn Gly Lys Asn Ser Arg Pro Phe 

260 265 270 

Arg Val His Asp Tyr Gin Pro Glu He Phe Leu Thr Gin Pro Val Tyr 

275 280 285 

Ser Asp Leu Pro Trp Asp Gly Lys Val Arg Met He Gly Met Gly Ala 

290 295 300 

Val His His S^r Asn Gly Glu Ser Ala Lys Leu Ser Arg Ser Trp Asn 
305 310 315 320 

Arg Ala Tyr ■ -u Met Ala Gly Met Glu Trp Lys Asn Leu Thr Val Met 

325 330 335 

Pro Arg II- rp Gly Arg He Phe Lys Glu Gly Ser Gly Ser Gin Pro 

: *0 34S 350 

Asp Asp As-. ?ro Asp He Leu Asp Tyr Tyr Gly Tyr Gly Asp Val Arg 

35^ 360 3 e5 

Phe Leu Ty-. Gin Leu Glu Asn Lys Ser Asn He Ser Gly Thr Val Arg 

370 375 380 

Tyr Asn Pro Arg Ser Gly Lys Gly Ala Leu Gin Leu Asp Tyr Val Tyr 
385 390 395 400 

Pro Leu Gly Lys Gly He Ser Gly Tyr Phe Gin He Phe Gin Gly Tyr 

40S 410 41S 

Gly Gin Ser Leu He Asp Tyr Asn His Glu Ala Thr Ser Phe Gly Val 

420 425 430 

Gly Leu Met Leu Asn Asp Trp Met Gly Leu 
43S 440 

<210> 9 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> Primer sequence 
<400> 9 

gatttaagag tatgttatga tg 22 

<210> 10 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer sequence 
<400> 10 

gtatgggttg accaaataca g 21 

<210> 11 
<211> 58 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Oligonucleotide 
<400> 11 

aagggcccaa ctacgcagag gggatcceaa gctgtaccaa atcctgtggc atttgttg 58 

<210> 12 
<211> 60 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Oligonucleotide 
<400> 12 

aagggcccaa tcacgcagag ggtcgaccca ctacagaccc atccagtcgt taagcataag 60 
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<210> 13 
<211> 1000 
<212> DNA 

<213> Ho r axe 11a. catarrhal ia 



<400> 13 



acttggcgaa 


aaeaccatte 


atatcgaeeg 


tgacgttata 


caggcagacg 


gcggtacacg 


60 


cacagccagt 


atcagtggtg 


ctgcggcggc 


acecattgac 


gctttagaac 


acttgcagcg 


120 


tcgcaaaaag 


cttacccaag 


aeccgcttet 


gggcccggtg 


gcagcggctt 


ctgcgggtgt 


130 


taatcaaggc 


cgegtattgc 


ttgatttgga 


ttatgctgaa 


gaeecaactt 


gtgatacega 


240 


tttaaatgcg 


gtcacgacgc 


aggcaggtgg 


gceeaetgag 


actcaaggca 


cagcagaaga 


300 


aaagccatte 


actcgtgctg 


aagccaatgc 


gatgcccgat 


ttggcagagc 


cgggaattgg 


360 


gcagatcatc 


gaagcccaaa 


agcaagtatt 


aggctggtga 


tatgccaacc 


gttgaagaca 


420 


atggcgcgae 


catcacatta 


aacggacaag 


taaaagaccc 


attaeetegg 


tggtcgaega 


480 


tattgctgct 


gctgggtgtc 


ttggtggcaa 


tcacttgctc 


gaetgcaccc 


gttetteatg 


540 


caatcggtgc 


gttggcttta 


ttcgcagttg 


cggcattege 


gtttaatatt 


caaaggcaaa 


600 


aagccaaaac 


ttgtcatatg 


ttttcacaag 


gtcgcttgaa 


gaccacgtcc 


aaacgctttg 


660 


agactcataa 


caaatcacta 


accttatcag 


catcggcaac 


aatatccgct 


aaagacaaca 


720 


aaacgacaat 


tgttgatcgg ggcattgaae 


atcattttac 


aggttccgct 


gacgaccgtg 


780 


aaattaacac 


agccaaacag gtacttttgg 


gaaagtcaat 


caaaaccaat 


gcggtggcgg 


840 


taacateggc 


taagtagttg 


ttgtgataca 


gacaggctgg 


atggtcttca 


actccaccca 


900 


cccaactctt 


tcteegtttg gatttaagag 


tatgttaega 


cgggcaggat 


tttattttaa 


960 


gtcaccactt 


aatgcaatea 


gctgtccaga 


gtagccgttc 






1000 
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